MOD-</) CONVERGENCE AND PRECISE DEVIATIONS 

VALENTIN FERAY, PIERRE-LOIC MELIOT, AND ASHKAN NIKEGHBALI 

Abstract. In this paper, we use the framework of mod-(^ convergence to prove precise 
large or moderate deviations for quite general sequences of random variables (X„)„gN- 
The random variables considered can be lattice or non-lattice distributed, and single or 
multi-dimensional; and one obtains precise estimates of the fluctuations P[X„ G tnB], 
instead of the usual estimates for the rate of exponential decay log(P[X„ e tnB]). The 
four first sections of the article are devoted to a proof of these abstract results. We 
fT^ then propose new examples covered by this theory and coming from various areas of 

T— I mathematics: classical probability (multi-dimensional random walks), number theory 

^^ (statistics of additive arithmetic functions), combinatorics (statistics of random permu- 

^ ^ tations) , random matrix theory (characteristic polynomials of random matrices in com- 

pact Lie groups) , graph theory (number of subgraphs in a random Erdos-Renyi graph) , 
and non-commutative probability (asymptotics of random character values of symmet- 
ric groups). In particular, we complete our theory of precise deviations by a concrete 
^D method of cumulants and dependency graphs, which applies to many examples of sums 

^"^ of "weakly dependent" random variables. 
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1. Introduction 

The notion of mod-(/) convergence has been studied in [JKNll, DKNll, KNIO, KN12, 
BKN13], essentially in connection with problems from number theory, random matrix 
theory and probability theory. The main idea was to look for a natural renormalization 
of the characteristic functions of random variables which do not converge in law (instead 
of a renormalization of the random variables themselves). After this renormalization, the 
sequence of characteristic functions converges to some non-trivial limiting function. We 
first recall the definition of mod-0 convergence, but in the more restrictive case (but which 
is the framework for precise large deviations) where the moment generating functions are 
defined on an open neighborhood of 0. 

Definition 1.1. Let {Xn)n&N be a sequence of random variables, and ipn{z) = E,[e^^"] be 
their moment generating functions, which we assume to all exist in a strip 

Sc = {z, — c < Re^; < c}, 

with c positive real number. We assume that there exists an infinitely divisible distribution 
(f) with moment generating function J e^^(f){dx) = exp{ri{z)), and an analytic function 
ip{z) that does not vanish on the real part of Sc, such that locally uniformly in z & Sc, 

exp(-t„ r]{z)) ipniz) -^ ipiz), (1) 

where (tn)nGN 'is some sequence going to +oo. We then say that (Xn)n&N converges mod-0, 
with parameters (tn)n6N (ind limiting function ip. In the following we denote ipn{z) the 
left-hand side of (1). 

Remark 1.2. These hypotheses imply that the X^s and (p have moments of all order; in 
particular, (p cannot be any infinitely divisible distribution (for instance the Cauchy dis- 
tribution is excluded). In the examples, we shall see that the most common situations are 
(p = J\f{0, a^) (centered Gaussian law with variance cr^) corresponding to the framework 
of the so-called mod-Gaussian convergence and (p = V{\) (Poisson law with parameter A) 
corresponding to the case of mod-Poisson convergence. In the sequel we shall use quite 
often the fact that the local uniform convergence of analytic functions imply those of their 
derivatives by Cauchy formula. 

Remark 1.3. It is to avoid the restrictions mentioned in the previous remark that the 
theory of mod-0 convergence was initially developed with characteristic functions rather 
than moment generating functions. There are indeed many examples for instance of niod- 
Cauchy convergence (see e.g. [DKNll, KNNIO]) or more generally of mod-0 convergence 
where is a stable distribution. For lattice examples of situations where the moment 
generating method would not work the reader can look at [BKN13]. 

It is immediate to see that mod-0 convergence implies a central limit theorem if the 
sequence of parameters t„ goes to infinity (see the remark after Proposition 2.8). But 
in fact there is much more information encoded in mod-0 convergence than merely the 
central limit theorem. Indeed the works [JKNll, DKNll, KNIO, KN12, BKN13] tend to 
illustrate the fact that mod-0 convergence appears as a natural extension of the framework 
of sums of independent random variables in the sense that many interesting asymptotic 
results that hold for sums of independent random variables can also be established for a 
sequence of random variables converging in the mod-0 sense. For instance, under some 
general extra assumptions on the convergence in (1) it is proved in [DKNll] and [KN12] 
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that one can establish local limit theorems for the random variables X„. In particular 
it is shown in [DKNll] that the framework of mod-0 convergence extends the classical 
framework of sums of i.i.d. random variables for local limit theorems (the local limit 
theorem of Stone appears as a special case of the local limit theorem in the framework 
of mod-0 convergence). On the other hand it applies also to a variety of situations 
where the random variables under consideration exhibit some dependence structure (e.g. 
the Riemann zeta function on the critical line, some probabilistic models of primes, the 
winding number for the planar Brownian motion, the characteristic polynomial of random 
matrices, finite fields L-functions, etc.). It is also shown in [BKN13] that mod-Poisson 
convergence (in fact mod-0 convergence for a lattice distribution) implies very sharp 
distributional approximation in the total variation distance (among other distances) for 
a large class of random variables. In particular it is shown that the total number of 
distinct prime divisors u{n) of an integer n chosen at random can be approximated in the 
total variation distance with an arbitrary precision by explicitly computable measures. 
Besides these quantitative aspects, mod-(/) convergence also sheds some new light on the 
nature of some conjectures in analytic number theory. Indeed it is shown in [KNIO] 
that the structure of the limiting function appearing in the moments conjecture for the 
Riemann zeta function by Keating and Snaith [KSOOb] is shared by other arithmetic 
functions and that the limiting function ip accounts for the fact that prime numbers do 
not behave independently of each other. More precisely, the limiting function ip can be 
used to measure the deviation of the true result from what the probabilistic models based 
on a naive independence assumption would predict. One should note that these naive 
probabilistic models are usually enough to predict central limit theorems for arithmetic 
functions (e.g. the naive probabilistic model made with a sum of independent Bernoulli 
random variables to predict the Erdos-Kac central limit theorem for u{n) or the stochastic 
zeta function to predict Selberg's central limit theorem for the Riemann zeta function) but 
fail to predict accurately mod-0 convergence by a factor which is contained in ip. More 
generally it seems that the limiting function encodes information about the dependence 
between the different components of a random vector: indeed it is noted in [KN12] that 
the log of the characteristic polynomial of a random unitary matrix, as a vector in M^, 
converges in the mod-Gaussian sense to a limiting function which is not the product of 
the limiting functions of each component considered individually although when properly 
normalized it converges to a Gaussian vector with independent components. But we do 
not understand yet well the nature of the limiting function in relation with the dependence 
structure of the random variables. The remark 2.5 of the present paper emphasizes another 
property of the limiting function ip: it measures the deviation of the distributions of the 
XnS from that of the infinitely divisible distribution 0. 

More generally the goal of this paper is to prove that the framework of mod-0 conver- 
gence as described in Definition 1.1 is also suitable to obtain precise (i.e. estimates of 
the probabilities without the log) large and moderate deviations results for the sequence 
{Xn)n(zfq. As Stated in Remark 2.11 our results can be viewed as an extension of the 
classical results of Bahadur-Rao and Ellis-Gartner to the more general case described in 
Definition 1.1, but sometimes with different scales or regimes since we are interested in 
events of the form P[X„ G t„S] rather than l/t„logP[X„ G t„i?]. In fact we provide an 
asymptotic expansion in powers of l/t„ whenever the convergence in Definition 1.1 is fast 
enough. Moreover we are able, at least in the non-lattice case, to establish precise large 
deviations results in a multidimensional setting. This situation requires more care since 
the geometry of the Borel set B, when considering F[Xn G tnB], will play a crucial role. 
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The arguments involved in the proofs of our large deviations results in dimension one 
are standard but they nonetheless need to be carefully adapted to the framework of 
Definition 1.1: elementary complex analysis, the method of change of probability measure 
or tilting due to Cramer or adaptations of Berry-Esseen type inequalities with smoothing 
techniques. However it should be noted that in the multidimensional case the arguments 
are more involved and we provide new Berry-Esseen type estimates for Gaussian regular 
domains or polytopes which have an interest in their own and which extend recent results 
in [BRIO]. 

Remark 1.4. One should here give much credit to Hwang who has already established some 
of our results in the lattice case in [Hwa96] using the same framework as in Definition 
1.1. The results in [Hwa96] are one dimensional, they do not cover all regimes we shall 
consider. Moreover lower order terms in the asymptotic expansion for probabilities for 
large deviations are not considered. The results and ideas of Hwang have already been 
used by Radziwill in [Rad09] to prove precise large deviations for a large class of additive 
arithmetic functions. We shall also recover these results in Section 5 as an illustration of 
the Selberg-Delange method. 

These abstract results being established, one needs to provide a large set of (new) 
examples where these results can be applied. We have thus devoted the last four sections 
of the paper to examples, from a variety of different areas. Section 5 contains what one 
would be tempted to call "known examples" in the sense that most of these examples were 
proved to satisfy mod-0 convergence in earlier works. In some of the cases the precise 
moderate or large deviations results we obtain are new. For instance this is surprisingly 
the case for the sums of independent random variables and its applications to the loss of 
symmetry of the multidimensional random walks on Z'^ conditioned to stay far away from 
the origin. The results of Section 5.5 on the characteristic polynomial of random unitary 
matrices of different types is also new and can be viewed as completing the previous results 
by Hughes, Keating and O'Connell [HKOOl] on large deviations for the characteristic 
polynomial as well as the local limit theorems obtained in [KN12, DKNll]. We also 
recover results of Radziwill [Rad09] on precise large deviations for additive arithmetic 
functions by carefully recalling the principle of the Selberg-Delange method as well as 
results by Nikeghbali and Zeindler [NZ13] on precise large deviations for the total number 
of cycles for random permutations under the general weighted probability measure as an 
illustration of the singularity analysis method. 

We also propose in Section 3.2 some techniques based on control of cumulants (in 
the mod-Gaussian framework) to establish precise moderate deviations results. Roughly 
speaking, the idea behind is that we have mod-gaussian convergence when after normal- 
ization, the second cumulant explodes, the third one converges and the remaining ones 
tend to zero. The results of Section 3.2 will be the basis to a whole body of new exam- 
ples of mod-Gaussian convergence that are introduced from Section 6. More precisely, 
in Section 6 we combine the results from Section 3.2 with dependency graphs to prove 
precise moderate deviations for the sum of partially dependent random variables, thus 
improving and strengthening previous results (all references and details are provided in 
Section 6). The idea of using bounds on cumulants to show precise deviations for a family 
of random variable with some given dependency graph is not new, but the bounds we ob- 
tain in Theorem 6.2 (and also in Theorem 6.3) are much stronger than those which were 
previously known. We then apply these results to establish precise moderate deviations 
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for subgraph counts in Erdos-Renyi random graphs (Theorem 7.1). Eventually, as a last 
application, in Section 8, we use the machinery of dependency graphs in non-commutative 
probability spaces, namely, the algebras C&{n) of the symmetric groups, all endowed with 
the restriction of a trace of the infinite symmetric group (5(oo). The technique of cumu- 
lants still works and it gives the fluctuations of random integer partitions under so-called 
central measures in the terminology of Kerov and Vershik. Thus, one obtains a central 
limit theorem and moderate deviations for the values of the random irreducible charac- 
ters of symmetric groups under these measures. Moreover, Theorem 8.10 gives an idea 
of an infinite-dimensional generalization of the notion of mod-Gaussian convergence: the 
covariance matrix gets replaced by a quadratic form on C[x], and the limiting function is 
the exponential of a trilinear form on the same space of polynomials. 

We close this introduction by giving a few examples which will guide our intuition 
throughout the paper. In these examples, it will be useful sometimes to precise the speed 
of convergence in Definition 1.1. Thus, 

Definition 1.5. We say that the sequence of random variables {Xn)n£N converges mod-ip 
at speed 0{{tn)~") if the difference of the two sides of Equation (1) can be bounded by 
Ck {tn)~^ for any z in a given compact subset K of Sc- We use the analogue definitions 
with the o(-) notation. 

Example 1.6. The first example was used in [KNNIO] to characterize the set of limiting 
functions in the setting of mod-0 convergence. Let {Yn)n(^m be a sequence of centered, 
independent and identically distributed random variables, with E[e^^] = E[e^^i] analytic 
and non-vanishing on a disc D{0, R), possibly with R = +oo. Set Sn = Yi + ■ ■ ■ + Yn- 
The cumulant generating series of Sn is 



logE[e^^"] = n logE[e^^] =n^'^ 

r=2 






which is also analytic on D(0,R). Let f > 3 be an integer such that k^^^(Y) = for 
r G [3, f — 1], and set X„ = -^j^. One has 

^«:(2)(r) 2 k^^KY) , ^ «:M(r) , 



logipniz) = n— -^-^ z" + \^ ^ + Z^ 



r=i'+l 



and locally uniformly on S/j the right-most term is bounded by -f^, so it converges locally 
uniformly to 0. Consequently, 

, , . ( .^aV\ , . /«:(-)(F) ^ 
ipniz) = exp -n " -—- (fniz) -)■ exp j — z 



that is, {Xn)n&'N converges in the mod-Gaussian sense with parameters tn = cr"^ n v , speed 
Oi^n"^^'") and limiting function tp{z) = exp(K^^\Y) z"" /v\). 

Example 1.7. More generally, let {Sn)ne'N be a sequence of real-valued centered random 
variables that admit moments of all order, and such that \K^'^\Sn)\ < (Cr)'' n""*"^^ for all 
r > 2 and for some constants C, a, b. Assume moreover that there exists an integer f > 3 
such that 

(1) K^''\Sn) = for all 3 < r < v and all neN; 
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(2) lim„^oo "^alfb'' = cr"^ and lim„_^oo ""^J+ft^ = L exist; more precisely, 
K^^\Sn) = a^n-+'' (l + o(n-'^("^))) ; «:(^)(5„) = Ivn'^+''^ (l + o(l)). 

Set X„ = _^f'^^ . The cuniulant generating series of X„ is 



711' 



r=t)+l 
r=?;+l 

where the o(l) is locally uniform. The remaining series is locally uniformly bounded in 
absolute value by 

r=v+l r=v+l ^ ' 

Hence, 

,1.-2 cr^2;^\ , , ( L 



ijjn{z) = exp ( -n" - — — j v2„(z) -^ exp ( — z" 



locally uniformly on C, so one has again mod-Gaussian convergence, with parameters 
tn = a"^ n"'^'^ and limiting function ipi^) = e^^". In the case of sums of i.i.d. variables, 
a = 1 and 6 = 0. However, this framework includes many more examples than sums of 
i.i.d. variables: in particular, in Section 6, we show that such bounds on cumulants can 
be obtained for sums of partially dependent random variables. 

Example 1.8. Denote X„ the number of disjoint cycles (including fixed points) of a random 
permutation chosen uniformly in the symmetric group ©„. Feller's coupling (c/. [ABT03, 
Chapter 1]) shows that X„ =(iaw) XliLi "^(i/*)' where Bp denotes a Bernoulli variable equal 
to 1 with probability p and to with probability 1 — p, and the Bernoulli variables are 
independent in the previous expansion. So, 

E[e^^i = n ( 1 + —^ ) = ^"""^'"'^ n T^ 

i=i V ^ / i=i e » 

where if„ = Y17=i 1 ~ ^^gn + 7 + O(^). The product in the right-hand side converges 
locally uniformly to an entire function, therefore, 

00 1 I C^-l 1 



E[e^^"]e-('=^-^)i°^"^e4'=^-i)n- 



eZ-l 



e » 



rfe^ 



1=1 

locally uniformly, i.e., one has mod-Poisson convergence with parameters t„ = logn and 
limiting function f?^- Moreover, the speed of convergence is a O(^), hence, a o{{tn)~^) 
for any integer v. 
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2. Large deviations in the case of lattice distributions 

In the theory of large deviations, the following transform on (convex) functions is known 
to play a central role (see e.g. [DZ98, §2.2]): 

Definition 2.1. The Legendre-Fenchel transform of a function 7] is defined by: 

F{x) = sup {hx — rj{h)). 



This is an involution on convex lower semi- continuous functions. 

If rj is the logarithm of the moment generating series of a random variable, then F is 
always non-negative, and the unique h maximizing hx — r]{h), if it exists, is then defined 
by the implicit equation r]'{h{x)) = x. This implies the following useful identities: 



F{x) = xh{x) — rj{h{x)) 



F'{x) = h{x) 



F"(x) 



r]"{h{x))' 



Example 2.2. li r]{z) = mz+ ^-^ (Gaussian variable with mean m and variance cr^), then 



h{x) 



X — m 



a^ 



FAf{m.,a^)ix) 



{x — my 

2a2 



whereas if ri{z) = A(e^ — 1) (Poisson law with parameter A), then 

X log I — (x — A) if X > 0, 



h{x) = log 



"PW 



X 



+ 00 



otherwise 



"Afim^cr^) 



V(\) 




+00 



Figure 1. The Legendre-Fenchel transforms of a Gaussian law and of a 
Poisson law. 



2.L Large deviations in the scale 0(t„). In this section we suppose that the X„'s 
and the infinitely divisible distribution are lattice distributed, and that has minimal 
lattice Z. This means that the XJs take values in Z, and that the characteristic function 



\u 



e^'^ (f){du) = exp(?7(i'u)) 



of the distribution has minimal period 2n. In particular, for every u G (0,27r), 
\ exp{ri{m))\ < 1, since the smallest period of the characteristic function of a Z-valued 
infinitely divisible distribution is also the smallest m > such that |0(e'")| = 1. 
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Lemma 2.3. Let X be a Z-valued random variable whose generating function (px{z) 
E[e^"^] converges absolutely in the strip Sc- For fc G Z, 

V/i e (-C, c), F[X = k] = — [ e-^'('^+^") ipxih + m) du; 

1 n Q~k(h+iu) 

^h e (0, c), F[X>k] = - y—^^f^ Vx{h + m) du. 
Proof. Since 



(^x(/i + m) = 5^P[X = A;]e^ 



-,A;(/!,+m) 

fcez 

P[X = k] e^^ is the /i;-th Fourier coefficient of the 27r-periodic and smooth function u ^-^• 
Vx{h + iu); this leads to the ffist formula. Then, assuming also /i > 0, 

P[X > A;] = ^ P[X = /] = ^ — / e~'(''+'") ^x (/i + m) du, 
i=k i=k ^ •^-'^ 

and the sum of the moduli of the functions on the right-hand side is dominated by the 

— kh 

integrable function ^_^-h ^x{h); so by Lebesgue's dominated convergence theorem, one 
can exchange the integral and the series, which yields the second equation. D 



We now work under the assumptions of Definition 1.1, and furthermore, we assume 
that the convergence is at speed 0((t„)~'"). 

Theorem 2.4. Let x be a real number in the range ofT]',,^s, h defined by the implicit 
equation r]'{h) = x. We assume tnX G N. 

(1) The following expansion holds: 
P[X„ = tnx\ = V ,^ ^{h) + ^ + ^ + . . . + ^-i^ + of 



The ttk's are rational fractions in the derivatives of rj and tp at h. More precisely, 
denote 



iw \ ,,, , iw w"^ 



^"<-'^'"i"i"^V!:s^j-"'<">y?:s^^2« 



i""^ \ / A / \\ s~^ o^kyw) 



fn{w) =^[h + -^=== exp{An{w)) = J2 



the last expansion holding in a neighborhood of zero. The coefficient a2k{w) is an 
even polynomial in w with valuation 2k and coefficients which are polynomials in 
the derivatives of ip and rj at h, and in -^tWt . Then, 



„2 



e 2 
a-k = I Oi2kiw) -^=dw, 
yzn 
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and in particular, 

ao = ip{h); 

__lilj"{h) 1 tP{h)7]^^\h)+Atlj'{h)r]^^\h) 15 ^(/i)(r7(3)(/i))2 
"^ ~ ~2 r]"{h) ^ 24 {r]"{h)y 72 {r]"{h))^ ' 

(2) Similarly, if x is a real number in the range ofri',,^^, then 



27rt„7]"ih) 1-e-H (tn) itn)'"-^ \(tny" 

where the bk 's are obtained by the same recipe as the at 's, but starting from the 
power series 

, . 1 - exp(-/i) 

9n\W) = y ^ fn{w} 

1 — exp I —h 



a/WW 



Remark 2.5. For x > rj'^O), the first term of tlie expansion 

exp(-t^F(x)) 
y/2ntnV"{h) 

is also tlie leading term in the asymptotics of P[l^„ = tnx], where (l^)ig]R_|. is the Levy 
process associated to the analytic function ri{z). Thus, the ip{h) measures the difference 
between the distribution of X„ and the distribution of Yt^ in the interval {tnri'{0),tnr]'{c)). 

Remark 2.6. If the convergence is faster than any negative power of t„, then one can 
simplify the statement of the theorem as follows: as formal power series in t„, 



^27rtnV"{h) exp(t„F(x)) P[X„ = t„x] = / /„(«;) e""^ dw 



i.e., the expansions of both sides up to any given power Oi tj^ ) agree. 

Remark 2.7. If one wants to remove the condition t„a; G N, then one can only keep the 
first term of the expansion, since tnX — \tnx\ = 0(1) in general. Thus, the first-order 
expansions of Theorem 2.4 hold for any real number x G {rj' {0) , rj' (a)) , with a remainder 
that is a 0(^^) uniform on compact subsets of (r/'fO), r/'fc)). 



Proof. With the notations of Definition 1.1, the first equation of Lemma 2.3 becomes 
P[X„ = t„x] = ^ [ e-*""('^+^") Mh + iu) du 

tnF{x) , 



271 



MOD-0 CONVERGENCE AND PRECISE DEVIATIONS 11 

We perforin the Laplace method on (2), and to this purpose we spht the integral in two 
parts. Fix S > 0, and denote qs = niaXug(_7r,7r)\(-5,5) | exp{ri{h + iu)—ri{h))\. This is strictly 
smaller than 1 (c/. Lemma 2.10 hereafter), since 



exp{rj{h + iu) - vW) = '^ ' = EQ[e'"^] 



ghX(t^) 



is the characteristic function of X under the new probability dQ(uj) = ^j^j;jqdP{u). 

As a consequence, if I{-5,s) and /(-<5,5)c denote the two parts of (2) corresponding to f_^ 
and /jj + //, then 

Ii-s,6r < -^ / (qsY-lMh + m)\du<2 (e-^(") qsf- max \ij{h + m)\ 

27r J(-5,SY «e(-7r,7r) 

for n big enough, since ipn converges uniformly towards ip on the compact set K = 
h + i[— 7r,7r]. Since qs < 1, for any S > fixed, /(_5^5)c e*"^*-^^ goes to faster than any 
negative power of t„, so 1(^6,6)" is negligible in the asymptotics. 

As for the other part, we can first replace ipn by ■?/' up to a (1 + 0{{tn)^^)), since the 
integral is taken on a compact subset of Sc- We then set u = . "' : 

where A{w) is the Taylor expansion 

, {h + iu) - m - ,\h) (i.) - 4^ iiuf = E ^ f ^i^= )" + o (^] 



(3) 



2d-1 

1. 1 v'w^ (A: +2)! \,/t::^m 




We also replace ip by its Taylor expansion 



k 



V'h-^,^)-E^(^,^)-^o 



Thus, if one sets 



lu; 



t k\ \^/t~^ml / I ^wtt (^+2)' Vv^ww, 



«fcH ■ of ^ 



then one can replace il)n{h + iw) e*"'^^"'-' by fn{w) in Equation (3), and moreover, each 
coefficient ak{w) writes as 

\k / \ fe+2 / \ k+2r 



12 
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with the afc,r(^)'s polynomials in the derivatives of ip and r] at point h. So, 



I^.,,,) =1 + 



For any power w'^' 



{tr. 



-tnF{x) 



'2v-\ 



S^/t„rj"{h) 



akiw e 



e 2 






(iw 



w" 



^= dw — I w 



e 2 



27r 



(iti; 



is smaller than any negative power of t„ as n goes to infinity: indeed, by integration 
by parts, one can expand the difference as e~^ ^"'^"^'^^^'^ Rm{\/Ui), where Rm is a rational 
fraction that depends on m, h, 6 and on the order of the expansion needed. Therefore, one 
can take the full integrals in the previous formula. On the other hand, the odd moments 
of the Gaussian distribution vanish. One concludes that 



P[X„ = tnX] 



and each integral J^a2ki 



-tnF{x) 




e 2 
a2k{w) —^dw 
\/2tt 




w] 



dw is equal to 



a2kAh) {2k - ly.l a2kAh){2k + 2r-l)\\ 

{r]"{h)f +■■■+ {r]"{h)f+^ 

where {2m — 1)!! is the double factorial (2m — l)(2?7i — 3) ■ ■ ■ 3 1, that is to say the 2r7i-th 
moment of the Gaussian distribution. This ends the proof of the first part of our Theorem, 
the second formula coming from the identities h = F'{x) and ri"{h) = p,},^-. ■ The second 
part is exactly the same, up to the factor 

-h 



1 



l-e" 



1-e 



-h—iu 



1-e- 



-h — 

g \/tnv"{h) 



in the integrals. 



D 



2.2. Central limit theorem at the scales o(t„) and o((t„)^/^). By making an expan- 
sion around //'(O) and modifying a little the arguments of the previous proof, one also gets 
the following central limit theorem: 

Proposition 2.8. Assume y = o((t„)"'^'®). Then, under the assumptions of Theorem 2.4, 

^[Xn > W(0) + VtnV"{0)y] = P[A/'(0, 1) >y]{l + o(l)) . 

More generally, if s = o{l), x = r]'{0) + s and h is the solution of ri'{h) = x, then 



^f^O,l)>h^/t„rf^^h^ (1 + 0(1)). 

Remark 2.9. In the case y = 0(1), which is the classical central limit, this follows imme- 
diately from the assumptions of Definition 1.1, since by a Taylor expansion around of 77 
the characteristic functions of the rescaled r.v. 

Xn - tnV(O) 



K 



vum 
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converge pointwise to e 2 , the characteristic function of the standard Gaussian distribu- 
tion. The improvement here is the weaker assumption y = o((t„)^/^). 

Lemma 2.10. There exists a constant D > only depending on rj, and an interval 
{—6,e), such that for all h G {—6,e) and all 6 small enough, 

qs = max \exp{r]{h + iu) — r]{h))\ < I — D 5'^. 

ue(-(5,<5)'= 

Proof. We have seen that exp{ri{h + iu) — ri{h)) can be interpreted as the characteristic 
function of X under the new probabihty measure dQ = jg^sxr dF. So, for any u, 

I exp{ri{h + iu) - i]{K))\^ = |EQ[e'"^]|' = ^ Q[X = n] Q[X = m] e^^^""™) 

= ^ j ^ Q[X = n] Q[X = m] j cos ku. 

fcsZ \n—m=k / 

Since Z is the minimal lattice for 0, there exists n, m with n — m = \ and P[X = n] 7^ 0, 
P[X = m] ^ 0. Then one also has Q[X = n] ^ 0, Q[X = m] ^ 0, and 

Q[X = n\ Q[X = m] > 15 £> > 

for h small enough. As cosm < 1 — ^ for all n G (— tt, tt), 

I exp(r7(/i + iw) - riih))"^ < 1 + 15D (cosm - 1) < l-3Du^; 

qs < Vl — 3 D S"^ < 1 — D 5^ for S small enough. 

We also refer to [Ess45, Theorem 6] for a general result on the Lebesgue measure of the 
set of points such that the characteristic function of a distribution is bigger in absolute 
value than 1 — 6'^. D 



Proof of Proposition 2.8. Notice that ^^"(O) 7^ since this is the variance of the law 0, 
which is non-zero because of the hypotheses put on its characteristic function. Set t^x = 
^n(^'(0) -|- s), and assume s = o(l). The analogue of Equation (2) reads in our setting 

g-i„F(x) rn ^tn{r){h+iu)-r){h)~iur)' {h)) 
P[X„ > tnX] = / 3^— iJn{h + iu) du. (4) 

Since h'{x) = F"[x) = ,h-. , h = -^A^ + O(s^). The same argument as in the proof of 

Theorem 2.4 shows that the integral over {—6, 6)'^ is bounded hj C 6 {qs)^", where qs < 1, 
and C 6 (with C a constant independent from s and 6) comes from the computation of 

■ip{h + iu) 



max 

«e(-(5,(5)': 



1-e 



-h—iu 



In the following we shall need to make 6 go to zero sufficiently fast, but with 5^^/tnrj"(0) 
still going to infinity. Thus, set 5 = (t,„)^^/^, so that in particular (tn)~^^^ ^ 5 ^ (tn)^^^'^- 
Notice that I{-c,c) e*"^'-'^'-* still goes to zero faster than any power of t„; indeed, 

D \ " , r,u M/5 



M'<{l-j^] <e-<' 
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The other part of (4) is 

2'K^Jtn'n"{h) J-s^tnn"{h) \ VtnV"{h)J 1-e'' v'^^^VVO 

up to a factor (1 + o(l)). Let us analyze each part of the integrah 



The difference between ijj { h-\ — , '"^ and ^(0) is bounded by 

max Mz)-^m = o{l) 

z&[-s,s]+i[-5,S] 

by continuity of ip, so one can replace the term with ip by the constant ip{0) = 1, 
up to factor (1 + o(l)). 

The term A{w) has for Taylor expansion 

/ \ 3 



6 \y^Uf{h)J \itn) 

so tnA{w) is bounded by a 0(t„5'^), which is a o(l) since 5 <ti (t„)^^'^. So again 
one can replace e*"^*^"'-' by the constant 1. 

• The Taylor expansion of j 1 — e Vtnv"(h) J jg \^ - (1 + o(l)). Hence, 

P [X„ > t„(V(0) + .)] = — — / ^— --^ ^ . dw]{l + 0(1)) 



-t„F(x)+ ^^'"''"''^> 



Indeed, setting /3 = h^Jtn^"{h), one has: 



A/'(0, 1) > /i A/i„r7"(/i)j (1 + 0(1)). 



_^ 1 /■ e " 1 / e' "^2 , 

e 2 — / — dw = — (b az 

27r jRf3 + iw 2m /r=/3+iiR z 



00/1 r , ^2 2 

e 2 az ] da 



2ivr ./r=a+iR 



°^e-^ 



rfa = P[Ar(0,l)>/3] 
Ztc 

by using a calculus of residue on the second line to differentiate with respect to /3. 
Hence, we have shown so far: 

P [Xn > tniv'iO) + s)] = e-*"^('''(°)+^) e^ F[^f{0, 1) > /3] (l + o(l)) . (5) 



Set y = sytjrfi^, and suppose that y = o((t„)^''^), or, equivalently, s = o{{tn)^^^'^) 
— this is stronger than the previous assumption s = o(l). Let us then see how everything 
is transformed. 
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By making a Taylor expansion around 77' (0) of the Legendre-Fenchel transform, 
we get 



F{x) = F{r]'{0)) + F'{r]'{0)) s + 



^^^s^ + Ois^) 



r 

2tr 



+ o{t 



n )i 



SO Q-tnF{r,'{0)+s) ^ g-iL_ 

On the other hand, 

s 



I5 = h^tni'{h) 



r/"(0) 



l + 0{s)) v/t„(V'(0) + O(s)) = y (l + 0(s)) = y {l + o{{tnV)) 



Consequently, f? = y^(l + o((t„)~^/^)) = y'^ + o(l), so e~ can be replaced safely 
by e 2 , which compensates the previous term. 

Finally, fix y, and denote Fy{\) = P[A/'(0, 1) > \y]. Then, for |A| say between ^ 
and 2, 



mx)\ 



e 2 



< max 

j/GR 



y _2d 

e 8 



^27r 



|P[Ar(0,l) > /3] -P[Ar(0,l) > y]| = |F,(l + o((t„)-i/^)) -i^,(l)| < 



C < +00; 



c 



oil). 



(t„)V6 

This ends the proof of Theorem 2.8, with the refinement that y can be as large as a 

0((tn)^/'). □ 



Remark 2.11. Theorems 2.4 and 2.8 generalize the usual central limit theorem and Baha- 
dur-Rao's estimates (see e.g. [DZ98, Theorem 3.7.4]) for sums of lattice-valued indepen- 
dent and identically distributed random variables. Indeed, one can consider that a sum 
Sn = Xi + - ■ ■+Xn of i.i.d. variables "converges mod-X with parameters tn = n and limit- 
ing function ■?/'(z) = 1", since E[e^'^"] = (E[e^'^])"'. Notice though that E[e^'^] might vanish 
and does not correspond to an infinitely divisible law, whence the quotes in the previous 
statement. However, one can follow exactly the same proofs to recover as a particular case 
the Bahadur-Rao estimates. The same remark will apply in the non-lattice case; for this 
case we refer also to [Ncy83, Ilt95] which give analogue results of "precise" large deviations 
(that is to say that one estimates directly P[X„ G tnB] instead of ^ logP[X„ G t„i?]). In 
a sense, the content of this paper is the missing box in the following diagram: 



Cramer's large deviations 
for sums of i.i.d. variables 



generalization 



precise estimates 



> Bahadur-Rao theorem 

generalization 



Ellis-Gartner theorem 

(conditionSOn ^log(^„(-)) precise estimates 



■> 



??? 



Notice however that the regime (order of renormalization) in which the precise estimates 
of probabilities will be given can be different from the usual regime of large deviations. In 
other words, we shall sometimes obtain the so-called moderate deviations of the sequence 
of random variables. For instance, for sums of i.i.d. random variables, the central limit 
theorem gives estimates in the regime 0{^/n), Cramer's theorem gives non-precise (loga- 
rithmic) estimates in the regime 0{n), and we will state results in the regime 0(n^/^), or 
even o{n^^^), see Theorem 3.2. 
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Example 2.12. Suppose that (X„)„gi^ is niod-Poisson convergent, that is to say that r]{z) 
e^ — 1. The expansion reads then as follows: 

with /i = logx. For instance, if X„ is the number of cycles of a random permutation in 
©n, then for x > 0, 

„-(a:loga:-a;+l) i 

P[X„ = b(logn)J] = -—(1 + 0(1)), 

V27rx logn 1 (a;) ^ 

and for x > 1, 

„-(xlogx-x+l) ™ 1 

Example 2.13. Equation (5) is the probabilistic counterpart of the number-theoretic re- 
sults of [Kub72, Rad09], see in particular Theorems 2.1 and 2.2 in [Rad09]. In §5.2, we 
shall explain how to recover the precise large deviation results of [Rad09] for arithmetic 
functions whose Dirichlet series can be studied with the Selberg-Delange method. 



3. Large deviations in the non-lattice case 

3.1. Berry-Esseen estimates and large deviations in the scale 0{tn). In this sec- 
tion we prove the analogues of Theorems 2.4 and 2.8 when is not lattice-distributed; 
hence, |e'''^'")| < 1 for any m 7^ 0. In this setting, there is a formula equivalent to the one 
given in Lemma 2.3, namely, 

1 / 1 /"^ g-x(h+m) \ 

P[X >d + -P[X = d = lim — / ^ ^x{h + m)du\ (6) 

2 R-^<x] \2'K J _^ h + lu J 

if y:>x{h) = E[e''"^] < +00 for h > 0. However, in order to manipulate this formula 
as in Section 2, one would need strong additional assumptions of integrability on the 
characteristic functions of the random variables X„. On the other hand, for non-lattice 
distributions, one would like to know not only the precise asymptotics of probabilities 
IP[Xn > tnx], but more generally of probabilities 

P[X„ G tnB] with B arbitrary Borelian subset of M. 

To this purpose, we should combine classical arguments of the theory of large deviations 
(see [DZ98]), and the following lemma which replaces Equation (6) and is the main tool 
for estimates of probabilities P[X„ > t„x]. 



Lemma 3.1 (Berry-Esseen expansion). Denote Fn{x) = P[X„ < tnr]'{0) + ^/tnr]"(0) x], 
and p{y) = (27r)~^/^e~^ /^ the density of a standard Gaussian variable. Under the as- 
sumptions of Definition 1.1, with cf) non-lattice, 




VUfW) 6v/tn(r/"(o))3 ) I Vv^ 

with the o(-) uniform on 
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Proof. We use the same arguments as in the proof of [Fcl71, Theorem XVI. 4.1], but 
adapted to the assumptions of Definition 1.1. Given an integrable function /, its Fourier 
transform is /*(C) = /rG"'^ /(^) d^- Consider a probabihty law F{x) = j_^ f{y) dy with 
vanishing expectation (/*)'(0) = 0; and G{x) = J g{y) dy a ?7i-Lipschitz function with 
g* continuously differentiable and 



lim G{y) = 

y—^—oo 



(/)'(0) = ; 
By [Fcl71, Lemma XVI. 3. 2], for any a; G M and any T > 

"^ r{0-9*{0 



lim G{y) = 1. 

3/— >+oo 



\F{x)-G{x)\ < 



TT 



T 



c 



dc 



24m 



Notice that this is true even when F does not admit a density / (see also §4.2.3). We 
shall apply this result to the functions 



F{x) = Fn{x) = cumulative distribution function of y„ 



Xn-W(O). 



G{x) = Gn{x) 



^'(0) 



y + 



V"(o) 



v/t;?^^'6vW(oF^^^-'^r^^^ 



The asymptotic behavior of /*(C) is given by the local uniform convergence ipn{z) -^ ip{z): 



E 






exp \tn\r] 
exp \tn\r] 
exp 



V(0) 



tnV"{0) 



a/WW. 

z 






V(0) 



+ \z\ o 



W(0), 

z 

tnV"iO) 

z 



X 1pn\ 



tnV"{0) 



<(0);z 
X I 1+ 7^ ^ +0, , 



V^n 



^'(0) z I z 



~ , . V-'IO) ^ r7'"(0) z3 

e^ 1 + ; ^ ' + — ^ ^ + 1 + \z\) o 



z 



ytn 



(7) 



Beware that in the previous expansions, the only o(-) that we manipulate is a 



f-n 



^ :£:('^ 1 with limeft) = 0. 



In particular, z might still go to infinity in this situation. To make everything clear we 
will continue to use the notation e{t) in the following. Up to the o(-), the last expression 
evaluated at z = i^ is the Fourier transform of Qn- Fix < 5 < A and take T = Ai/t^- 
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By Feller's lemma, 

-I /.AV*;; f*(n-n*(n 



\F4x) - G4x)\ < - / 



m)-9*nio 



c 



24m 



Anx/U 



1 f^"^ c^ . ,>,9^ f C \ ,. 24m 



Vr\/t^ ./„A./TT \\/UiJ ATTA/t 



n J ~&JUi \\/ ^n 



+ -A^ / i/:(c)-^:(c)Mc 

In the right-hand side, the first part is an %i, and the second part is smaller than J^r— 
for some constant M. Finally, the last integral goes to zero faster than any power of t„. 
Indeed, for |C| G \b ^ftn, A^ft^, 

' <K{A)\ max |exp(r/(iM))| I <K{A){q^ 

\y<(0)|«|G[5,A] 



i/:(c)i 



^n 



with qs < 1, and the constant K{A) determined by the uniform convergence of ipn to ip 
and by the behavior of ijj on the complex segment [—iA/^yr)"{0),iA/^yr|"{0)]. The same 
argument works for |5'i^(C)|; in the following the symbols ~ indicate constants that take 
care simultaneously of /*(C) and QniO- Fix e > 0, then S such that e{6) < e and M6 < e. 
Take A = |; we get 



for t„ big enough. This ends the proof of the lemma. D 

Theorem 3.2. Suppose non-lattice. If x is in the range of 7]',,^^, then 

p[x„ > t^x] = fl~';^l;,\^ Hh) (1 + o(i)) 

h ^^2Trtnri"{h) 
where as usual h is defined by the implicit equation rj'{h{x)) = x. 

Remark 3.3. Our theorem should be compared with [Hwa96, Theorem 1], which studies 
another regime of large deviations in the mod-0 setting, namely, when h goes to zero (or 
equivalent ly, x — )• ^^'(O)). 

Remark 3.4. The main difference between Theorems 2.4 and 3.2 is the replacement of the 
factor ilj{h)/{l — e~^) by ipijij/h] the same happens with Bahadur-Rao's estimates when 
going from lattice distributions to non-lattice distributions. 

Remark 3.5. In order to ease their uses and generalizations to a multidimensional setting, 
we have stated Theorems 3.1 and 3.2 with a single term in the expansions, but again 
there is a recipe to push these expansions further. Recall that the Hermite polynomials 
are defined by 

.2 (9'" / :.2- 

The first Hermite polynomials are Hq{x) = 1, Hi{x) = x, H2{x) = x^ — 1, H^lx) = x^ — 3x, 
etc. An important property of Hermite polynomials is that 

{p{x)Hr{x)no = {KYe-'^; (8) 
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it can be seen from the generating series of Herniite polynomials Yl'^o Hr{x) ^ = e*^~^ /^. 
On the other hand, with z = i(, the quantity (7) appearing in the proof of Lemma 3.1 
corresponds to the first terms of 

where /„ is the same function as in Theorem 2.4, with h = 0. Each coefficient 0:^(0 
is a polynomial in (i^), and we denote Ak{x) the image of afc(C) by the linear map 
{i(Y H-). Hr{x). Then, by using Equation (8) and adapting the proof, one sees that a 
better approximation of Fn{x) is 

2v 



LiLm'^HHi^) 



if one assumes that the speed of convergence is a o{{tn) ^)- This allows in Theorem 3.2 
to obtain an expansion of 

P[X„ > tnx] exp(t„F(x)) {h ^2ntnV"{h)) 

in powers of iZ-y/t^, though one also needs for these computations the precise asymptotics 
of all the integrals J^ y^ g-(s/+a) /2 ^y -^jth k fixed and a going to infinity. 



Proof of Theorem 3.2. The main trick, well known in the theory of large deviations, is to 
make the exponential change of probability 



Qhy 



^xAh) 



nX^Edy]; 



we denote X^ a random variable following this law. The Y^s have for generating functions 
(fj^ (z) = (px„{z + h)/ipx„{h), so they converge m.od-(f)', where cp' is the infinitely divisible 
distribution with characteristic function e^*^'*"'"^^"''*^'*^. The limiting function in the niod- 
(j)' convergence is also replaced by ^(z + h)/ilj{h), but otherwise all the assumptions of 

Lemma 3.1 are satisfied. So, the distribution function Fniu) of "," "^ ^ - is 

GM= r |1 + ^\ ^^^ y+ /"^^^^ {y'-5y)]p{y)dy 

up to a uniform o{l/ ^/t^^). Then, 



P[X„ > Ux] = / ^xMe-'yQidy) = vx„{h) / e-H^-^'W+v/^^^n^-JdF^ln) 

■J y=tnX J U=0 

fOO 

'u=0 



^„(/i)e-*"^(") [ e-Vi-^C^)"^^™ 
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To compute the integral /, we choose the primitive of dFn{u) that vanishes at m = and 
we make an integration by parts: 



hVw^« (ir^(^) _ ir„(o)) + h^/t~^f{h) / e-'^Vw^" ^FM - Fnm du 



n=0 



M=0 



hVtJfih) r e-^V^^^^f^)- (gM - G„(0) + o (^ 

Ju=0 V V V ^n 



du 



h^/tnV"{h) II e 

'o<y<u 



h^tnri"(h)u 



1 + 



i,'{h)y 



^m^^Uy)dydu 



^{h) ,/Ufih) Vtniv"ih)y 



r-OO 

'y=o 






where on the two last lines the symbol ~ means that the remainder is a o((tn) ^'^). Also 
by integration by parts, one can compute recursively the estimates 



e —dy = 1-^ + 0^ 



ye 2 dy = U\ 

a-" 



ye 2 ^^=^ l + OK 



(y+n) 



y* e 2 dy = 0\ 



e 2 



for a going to infinity, and this shows that the only contribution in the integral that is 
not a o((t„)~-'^/^) is 

1 



^-hy/Uf{h)yp(^y) dy = ^^^^ 

y=o h^2'ntnri"{h) 

This ends the proof since ipn{h) — > ipih) locally uniformly. 



D 



3.2. Precise moderate deviations for random variables with control on cumu- 
lants. Let us come back to the example of centered random variables Sn having cumulants 
K^'^\Sn) bounded in absolute value by (CrY an {(3nY, where (Q;„,)neN and (/3n)neN are two 
arbitrary sequences, a„ going to infinity — in the example of the introduction 1.7, we 
had an = n"" and /3„ = ra'', which is the most common case. We also suppose that 

k(2)(5„) = a2a„(/3„)2(l + o(K)''/'')) and that K^^\Sn) = Lan{l3nf {'^ + o{{an)-^h)- 
Then, one observes the following asymptotic regimes: 



Sn>Snl^n{an)^\ =-(l + o(l)). 



1 



(1) For any sequence s„ = o(l) 

P 

(2) More generally, for any sequence s„, = 0(1), 



P 



Sn > SnPn K)^! = P[A/'(0, a^) > S„]{l + o(l)). 



This follows immediately from Lemma 3.1, and one even knows that the remainder 
is in fact a 0((a;„)~^^^). 
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(3) Actually, the previous expansion holds for s„ = o((a„)^/^), see [Hwa96], and our 
Proposition 2.8 for the analog in the lattice case. So, for any such sequence going 
to infinity. 



P 



expi-%f 



^n > S„ /?„(«„) 5 = ^ y (l + 0(l)). 



(4) At the next level, the function ip(x) = exp(^) comes into play, so for any sequence 
Sn = 0(1) bounded from both sides by positive constants. 



P 



2 

Sn > s„/3„(a„)3 



exp(-K)5^) 

^ ^^(.„)(l + o(l)). 



S„(a„)6 y/2'KCr'^ 



Indeed, we can apply Theorem 3.2 to the sequence 5'n/(/3n (a^)^^'^), which con- 
verges mod-Gaussian with speed (a„)^'^cr^ and limiting function -0. 

One may then ask up to which order the previous expansion can be pushed. Up to 
a renormalization of the random variables, one can suppose a^ = 1. Set then X„ = 
Sn/{(3n {oinY^^)- Noticc that as long as z„ = o{{anY^^'^), the estimate 

^x„(.n)=exp(K)^^ + ^) (1 + 0(1)) 

holds, because the terms of order r > 4 in the series expansion of log (fx„ all go to zero 
— this is the same computation as after Definition 1.1, but noticing that the 0((a„)^^/^) 
in the estimate of the remainder is in fact a 0({znY (a„)^^/^), that is, a o(l) under the 
previous assumptions. This leads us to try to push the expansion up to x = x„ (a„)^/^^, 
with Xn = o{l). In the following we also suppose x — )■ cxd since the other situation is 
already known. If one makes the change of probability F[Yn G dy] = ° ,-. P[X^ G dy], 
then the generating function of Yn writes 



VyAz) = ^^— = exp > {{x + z) -x) 

\ogipY„iz) = ^^ (^z^ + 2zxn (a„)A) (^1 + o((a„)-A 



+ - (^z^ + 3z^Xn{an)'^ +32;(x„)2(a„)ij (^l + o(^(a„) i 

+ j2'^^iix+zy-xn. (9) 

r=4 

According to the previous discussion, the sum of terms with r > 4 goes to zero. So, locally 
uniformly, 

1 , X I , xA , L(a„)s(x„)M Ma„)5 +L(a„)T2 x„\ ^ L ^ 
log<^r„(^)= I (an) '^Xn+ '^' M z+ I ^^ ^^ 1 z' + -z'' + o{l), 

and if Zn = Yn — (an) 12 x„ 2 ' then the Z„'s converge in the mod-Gaussian sense 

with parameter (a„)3 + L(a„)i2 x„ and limiting function exp(^^). One can therefore 
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apply Lemma 3.1, and 



P 



P 






Xn > (a„)^ X = <^x„(x) / e-^y F[Yn G dy] 

y={a„)''-/^x 



^xAx)e 



2 



e"^" F[Zn G du] 



where Rn is given by 
Rn = e 2 



- {x„ (a„)i/4 ^l+L(a„)-i/i2a;„)^ 



£(3:n)^ 



Fn{dw). 



This integral -R„ is now computed exactly as before, and we obtain in the end the following 
result, to be compared to Edgeworth's expansion in the central limit theorem, see for 
instance [Fel71, XVI. 2 Theorem 2 and XVI. 4 Theorem 3], and also the results of [DE12]. 



Corollary 3.6. Let {Sn)ne'N be a sequence of centered real-valued random variables such 
that \K^^\Sn)\ < {Cry an i(3nY with a^ — > cxd and 

K^'\Sn) = a' «„ (/3„)2 (1 + o((a„)-^/i2)) ; k^^XS^) =Lan {Pnf (1 + o((a.)-^/«)). 
//(««)'/'< r<K)3/^ then 



P 



On 
Jn 



>T 



2tt(j 
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exp 



LT^ 



6a3 



ar. 



(1+0(1)), 



the second term in the expansion being meaningful for (anY^'^ ^ T ^ 



Or 



N3/4 



Corollary 3.6 hints at a possible expansion up to any order T = o((a„)^~^), and indeed, 
it is a particular case of the results given by Rudzkis, Saulis and Statulevicius in [RSS78, 
SS91], see in particular [SS91, Lemma 2.3]. Suppose that 

l^^^^'KSn)] < {CryanWnY ; K^'\Sn) = Kir)anWnni + Oiian)-')) 
the second estimate holding for any r < p; we denote o"^ = K{2). In this setting, one can 
push the expansion up to order o((a„)^~^/^). Indeed, define recursively for a sequence of 
cumulants (/€*-'"-')r>2 the coefficients of the Petrov-Cramer series A^^'-* = —br-i/r, with 

For instance, A*^^-* = — ^, A*-'^-' = ^^, A^^-* = - — ^24 — ^5 ^^^- The appearance of these 
coefficients can be guessed by trying to push the previous technique to higher order; in 
particular, the simple form of A'''^-' is related to the fact that the only term in z^ in the 
expansion (9) is ^^. If for the k^'^^^s one has estimates of order (a„)^~'"''^(l + 0((a;„)~^)), 
then one has the same estimates for the A'^^'^'s, so there exists coefficients L(r) such that 



A(0 



On 



L{r){anf-''\l + 0{{an)-')). 
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p-i 



Take then T = x (an) ^ with x = 0(1), but T (a;„) ^^'^ going to infinity; Lemma 2.3 of 
[SS91] ensures that 



P 



On 



>T 



27ra2 



r2 



exp 



.r=3 



r) 



T 



{any/' 



a 




(l + o(l)) 



exp > — ; \ -, (1 + o 1 ) 



a' a. 



Unfortunately, in this estimate, one cannot go up to T = 0(a„), because the generating 
series of the L(r)'s does not make sense. Indeed, unless one starts already with Gaussian 
variables (that is to say that K{r) = for all r > 3), this series has radius of convergence 
zero, since otherwise, by the classical (non-precise) large deviations, one would have nec- 
essarily Yl'^^L^r) (-) = 0, whence a contradiction. Nevertheless, the discussion shows 
that the method of cumulants of Rudzkis, Saulis and Statulevicius can be thought of as a 
particular case (and refinement in this setting) of the notion of mod-0 convergence. Also, 
it shows that one cannot expect precise large deviations in the regime T = 0(a„) (or 
even o{an)) by a cumulant method without an adequate "renormalization theory" of the 
Cramer-Petrov series. 



These estimates will be used extensively in the last sections of the paper. To conclude 
this paragraph, let us mention that in the same setting, one can state results similar to 
the law of the iterated logarithm. Let (S'„)„eN be a sequence of random variables that 
satisfy the hypotheses of Corollary 3.6 with a„ = n and /3„ = 1; the second condition can 
be ensured by looking at |^ instead of S'„, and the first condition is quite common in the 
setting of sums of random variables whose graph of dependence is of bounded degree, see 
the examples starting from Section 7. We suppose that the S^s are defined on the same 
probability space, and we look at sequences 7^ such that almost surely, 

limsup — < 1. 

n— >-oo "Jn 

The main difference between the following Proposition and the usual law of the iterated 
logarithm is that one does not make any assumption of independence. Such assumptions 
are common in this setting, or at least some conditional independence (for instance, a law 
of the iterated logarithm can be stated for martingales); see the survey [Bin86] or [Pet 75, 
Chapter X]. We obtain a less precise result, but which does not depend at all on the way 
one realizes the random variables S'„. In other words, for every possible coupling of the 
SnS, the following is true: 

Proposition 3.7. Under the previous assumptions, 



lim sup 



^/2o^^nAog 



< 1 almost surely. 



n 



Proof. Notice the term log n instead of log log n for the usual law of iterated logarithm. 
One uses of course Borel-Cantelli lemma, and compute 



F[Sn > ^/2{l + e)a^n\og 



-(l+e) logn 



n\ 



v/47ror4(l + e) log 



= < 

n n 



l+e 



for n big enough. 
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For any e > 0, this is sumniable, so eventually Sn < a/2(1 + e)a'^n logn. Since this is 
true for every e, 

lim sup — = < 1 . □ 



n^oo \/2a'^ n log 



n 



3.3. A refinement of the Ellis-Gartner theorem. Theorem 3.2 is also the first step 
of a refinement of the Ellis-Gartner theorem (see e.g. [DZ98, Theorem 2.3.6]) with precise 
estimates of the probabilities P[X„ G t„i?]. Until the end of this section, we make the 
following assumptions: 

(1) The random variables X„ satisfy the hypotheses of Definition 1.1 with c = +oo 
(in particular, tp is entire on C). 

(2) The Legendre-Fenchel transform F is essentially smooth, that is to say that it takes 
finite values on a non-empty closed interval Ip and that limF'(x) = lim/i(x) = 
±oo when x goes to a bound of the interval Ip {cf. [DZ98, Definition 2.3.5]). 

The later point is verified if is a Gaussian or Poisson law, which are the most important 
examples. 

Lemma 3.8. Let C he a closed subset ofM.. Either iniuec F{u) = +oo, or inf„gc'F(-u) = 
m is attained and {x G C | F{x) = min„gc'F(-u)} consists of one or two real numbers 
a < b, with a < "^'(0) <b if a ^ b. 




Figure 2. The infimum of F on an admissible closed set C is attained 
either at a = sup(C fl (— oo,r7'(0)]), or at 6 = inf(C fl [?7'(0), +oo)), or at 
bothif F(a) =F{b). 



Proof. Recall that F is strictly convex, since its second derivative is l/ri"{h), which is the 
inverse of the variance of a non-constant random variable. Also, 77' (0) is the point where 
F attains its global minimum, and it is the expectation of the law 0. li C (1 Ip = 0, 
then F\c = +C)0 and we are in the first situation. Otherwise, F\c is finite at some points, 
so there exists M G M+ such that C fl {s G M | F{x) < M} ^ 0. However, the set 
{x G M I F{x) < M} is compact by the hypothesis of essential smoothness: it is closed 
as the reciprocal image of an interval ] — 00, M] by a lower semi-continuous function, and 



bounded since lim^ 



>(ipr 



\F'(x) 



-00. So, C n {x G M I F(x) < M} is a non-empty 
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compact set, and the lower semi-continuous F attains its minimum on it, which is also 
muiu^c F {u) . Then, if a < 6 are two points in C such that F{a) = F{b) = min„gc'F(M), 
then by strict convexity of F, F{x) < F{a) for all x G (a, 6), hence, (a, 6) C C^. Also, 
F{x) > F{a) if X ^ [a, b], so either a = 6, or 77' (0) G (a, b). D 



We take the usual notations B° and B for the interior and the closure of a subset 
B gM.. Call admissible a (Borelian) subset B cM. such that F\b 7^ +00, and denote then 
F{B) = iniueB ^'{u) = min^g;gF(M), and -Bmin = {a E B \ F{a) = F{B)}; according to 
the previous discussion, -Bmm consists of one or two elements. 

Theorem 3.9. Let B be a Borelian subset ofR. 
(1) If B is admissible, then 




(l_c-|h(«)l)^r?"{/i{a)) 



limsup (V2vrt„ exp(t„F(S)) P[X„ G t„S]) < 

'in |fc(a)| v/^"{/i{a)) 

i/ie distinction of cases corresponding to lattice or non-lattice distributed. The 
sum on the right-hand side consists in one or two terms — it is considered infinite 
ifa = r]'{0) e Brain- 

(2) If B is not admissible, then for any positive real number M, 

lim (exp(t,M)P[X„ G t^B]) = 0. 

n—^oo 

Proof. For the second part, one knows that ipn{x) exp(— t„r7(x)) converges to iIj{x) which 
does not vanish on the real line, so by taking the logarithms, 

lim = ri[x). 

ri->oo tn 

Then, Ellis-Gartner theorem holds since F is supposed essentially smooth. So, 

hmsup < —F[B), 

and if B is not admissible, then the right-hand side is —00 and (2) follows immediately. 

For the first part, suppose for instance (f) non-lattice distributed. Take C a closed 
admissible subset, and assume 77' (0) ^ C — otherwise the upper bound in (1) is +00 
and the inequality is trivially satisfied. Since C^ is an open set, there is an open interval 
(a, 6) C C^ containing ^^'(O), and which we can suppose maximal. Then a and b are in 
C as soon as they are finite, and C C (—00, a] U [6, +00). Moreover, by strict convexity 
of F, the minimal value F{C) is necessarily attained at a or b. Suppose for instance 
F{a) = F{b) = F{C) — the other situations are entirely similar. Then, 

P[X„ G tnC] < P[X„ < t„a] + P[X„ > tnb] 

< exp(-*„F(C)) ( tjm^ + pm\ (1 + „(i)) 

" \-h(a)^2wt„rf{h{a)) h(b)^27rt„rf{h(b)) J ^ " 

by using Theorem 3.2 for P[X„ > t„6], and also for P[X„ < t„a] = P[— X„ > —tno] — 
the random variables — X„ satisfy the same hypotheses as the X„'s with 7]{x) replaced by 
ri{—x), iplx) replaced by ■?/'(— x), etc. This proves the upper bound when B is closed, and 
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since F{B) = F{B) by lower semi-continuity of B and i?min = (-B)r 
immediately to arbitrary admissible Borelian subsets. 



the result extends 

D 



One can then ask for an asymptotic lower bound on P[X„ G tnB], and in view of the 
classical theory of large deviations, this lower bound should be related to open sets and to 
the exponent F{B"). Unfortunately, the result takes a less interesting form than Theorem 
3.9. If S is a Borelian subset of M, denote B^ the union of the open intervals (x,x + k) 
of width n > 6 that are included into B. The interior O = B° is a disjoint union of a 
countable collection of open intervals, and also the increasing union lJ5>o ^^■ 




Figure 3. In some problematic situations, one is only able to prove a 
non-precise lower bound for large deviations. 



However, the topology of S° may be quite intricate in comparison to the one of the 
B^^s, as some points can be points of accumulation of open intervals included in B and 
of width going to zero (see Figure 3). This phenomenon prevents us to state a precise 
lower bound when one of this point of accumulation is a = sup(-B° fl {—oo,ri'{0)]) or 
b = inf(i?° n [//'(O), +oo)). Nonetheless, the following is true: 



Theorem 3.10. For an admissible Borelian set B, 



HHa)) 



liminf liminf (V2Trt^ exp(t„F(5^)) P[X„ G t^B]) > 

(5— >-0 n— s>oo 






/ae(B°)min \hia)\y/ri"{h{a))' 

with again the distinction of cases lattice /non-lattice. In particular, the right-hand side in 
Theorem 3.9 is the limit of ^fM:~n exp(tnF(5)) P[X„ G tnB\ as soon as F{B^) = F{B) 
for some 5 > 0. 

Proof. Again we deal with the non-lattice case, and we suppose for instance that (-B°)min 
consists of one point b = mi[B° n [//'(O), +oo)), the other situations being entirely similar. 
As 6 goes to 0, B^ increases towards B° = IJ5>o ^^^ ^*-* ^^^ infimum F{B^) decreases and 
the quantity 

L{S) = liminf (\/2^ exp(t„F(S'^)) P[X„ G tnB]) 

is decreasing in 6. Actually, if b^ = inf(i?^ fl [rj' {0),+oo)), then for 6 small enough 
F{B^) = F{b^), so lims^o F{B^) = F{B°) by continuity of F. On the other hand, 

^lj{h{b^)) 



R{5) 



h{¥)^r]"{h{¥)) 
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goes to the same quantity with b instead of b^. Hence, it suffices to show that for S 
small enough, L{S) > R{6). However, by definition of B^, the open interval {b^, b^ + S) is 
included into B, so 

P[X„ G t^B] > P[X„ G tnB'] > P[X„ > tnb'] - P[X„ > tnib' + 5)] 

^ ^{h{b'))e-'-^('') (1 I 0(1)) ^W^ + ^))e-^"^(^^+^) (1 I 0(1)) 

- /i(6^) v/27rt„r/"(/i(6^)) ^ ' ^^ 

since the second term on the second line is negligible in comparison to the ffist term — 
F{b^ + 6)> F{b^). This ends the proof. D 



4. MULTI-DIMENSIONAL EXTENSIONS 

In the following, vectors in M"' are denoted by bold letters V, W, . . ., and their coordi- 
nates are indexed by exponents, so V = (V-^^ . . . , V^'^^). This convention enables us to 
manipulate sequences (V„)„gN of (random) vectors in M'^ without ambiguity. 



4.1. Sum of a Gaussian noise and an independent random variable. The multi- 
dimensional extension of the previous results is geometrically non-trivial, and we shall 
restrict ourselves to the following situation, which is the mod-Gaussian setting: 

(a) A sequence of M'^-valued random variables (X„,)„gN is said to converge in the mod- 
Gaussian sense with parameters An G GL(]R°') and limiting function tp if locally 
uniformly on C^, 

with ijj analytic in d variables and non- vanishing on its real domain, and Z'„ = {An)~^ 
going to zero. 

(b) By replacing A^ by ^J A^A*^^ it does not cost anything in our definition to suppose 
that An is symmetric definite positive. 

(c) Our goal is then to find the precise asymptotics of the probabilities P[X„ G (y4„)^i?], 
where i? is a Borelian subset of M'^. In the one- dimensional case. An = \ft^ and we 
can rewrite the estimate of Theorem 3.9 as 



limsup ( W^ {tnb) exp( ^ ) P[X„ G t„5] ) < / ^(x) N{dx) 



where b = inf{|a;| , x G B}, and A^ is the counting measure on -Bmim which consists of 
one or two points. Moreover this estimate is sharp as soon as B contains small open 
intervals [b, b + 6) or (—6 — 6, —b] (according to the type of -Bmin)- 

Example 4.1. In order to make a correct conjecture, let us consider a toy model. We 
suppose here that X„ = Y+y4„G, where G is a d-dimensional standard Gaussian variable. 
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the AnS are symmetric and positive definite, and Y is a fixed random variable independent 
of tlie Gaussian G and witli generating series ip{z). A simple computation yields 



E 






4 = 1 



SO we are typically in the mod-Gaussian setting. We assume in the following that: 

(d) The random variable Y has a density z^(y) with respect to the Lebesgue measure, and 
is a bounded random variable. In particular, il){h) < e'""'^'^" for some constant C. 

The probabilities that we want to estimate write then as: 

P[X„ G {A,,fB] = -—— / / e-^ l(y+^„,e(^„)2B) z/(y) d^dy 

detAn f f _ u*(A„)2u / N , , 

= 7^^Z^dj2 / e ' Ms„ry+ueB)Hy)dndy 

Since Z'„ goes to zero, the term in parentheses goes by Lebesgue dominated convergence 
theorem to J^^ e(^\y) z/(y) dy = '?/'(b). To simplify a bit the discussion, we suppose: 

(e) For some fixed symmetric positive definite matrix A, An = y/t^A with t„ increasing 
to infinity. 

Then, in particular, the term in parentheses goes to ip{h) in an increasing way, and the 
convergence is locally uniform in b. One can even suppose A = Id, the general situation 
following by obvious changes of coordinates. We can then perform a Laplace method on 
the previous expression, but in a quite unusual multi-dimensional setting. Set 



b = mi{^/{h\ h) \he B}, 

and suppose 6 7^ 0. For any e > 0, we split B into i?<5+£, the set of points of B that are of 
norm ||6|| smaller than b + e, and B>h+e- We also denote -Bmin = {x G -B | a/(x | x) = b}. 

• The integral over -B>6+£ is smaller than the integral on the whole complementary 
of the ball of radius b + e and center the origin. Recall that the measure of the 
{d — l)-dimensional regular sphere is vol(S'^~^) = 2 ffdM- ^*-'' ^^ ^>b+e denotes the 
part of P[X„ G t„i?] corresponding to the integral over i?>f,+£, then 

T ^ I ^ri\ I - '"ll^ll^ (711x11 r ^ 2 I tn\ I d-1 - *""^ Cw i 



b+e 



2e^ fuy^' r ( c^''-' 



< ^, , , , — / \ X -\ e '^ dx 

-T{d/2)\2j A+,_^V tj 

En 

-r(d/2)\2J V t..i>J Jt+c^f 

1 I tn\ I C \ I d — 2\ _ tn{b+E) 



^fm[i) l^ + u;j I'-o-i e-^e--,.+. 



C(fe+£) /r , xd-2 
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assuming 7^ < e, and using integration by parts at the end to estimate the Gauss- 



ian integral. To fix the ideas, suppose that t„6 > C and t„6^ > 2{d — 2), which is 
surely the case for n big enough. Then, 

/>.+. < ^ (2 t^f' e-'-^ e^(^+^) ih + eY-\ (10) 

As for the integral over B^b^^, we have to use more precisely the topology of B 
in the neighborhood of the sphere of radius b. Recall that for any non-negative 
measurable function over M.'^, one can make the polar change of coordinates 

f{^)d^= I f{r^) rf/i§d-i (x) ) r'^-^ dr, 

Jr=0 \Jsd-^ / 

where figd-i is up to a scalar the unique SO(]R'^)-invariant measure on the sphere 
— the "spherical" Lebesgue measure. Consequently, we can write 

d/2 



(l + o(l))/<,+.= f;^') / e-'^ls{h)^P{h)dh 



t \'''^ t .2 



271"/ -'||c|l<6+e 



e 2 / e V ^ / 1b(c)'?/'(c) (ic 

yiicii<6 

d./2 , /.b+e 



(2^) """"^ / ^' ^^"^"^^ ^'"' (A , 1^(^^) ^(^^) ^'"s^-(c)) ^^, 



the multiplicative factor (l + o(l)) corresponding to the replacement of the integral 

J jjrf e^'^'^^ e 2^ ^(y) c^y by '?/'(b). Suppose for a moment that the integral in 
parentheses is continuous with respect to r, or at least continuous at r = 6. Then, 
up to a multiplicative factor (1 + o(l)), we can replace it by the constant 

^^ / ^(b)d/i§d-i(o,6)(b) 

where /i§d-i(o,6) = /^surface is the spherical measure on the sphere of radius b that 
gives for total weight fe'^^^ vol(§'^^^). Hence, 

/<6+.= (^) '^^(X V^(b)rf/isurface(b)) (^^'^%*"('^)r^-irfrj (1 + 0(1)). 

(11) 

Set e = (t„)~^/'^. In (10), the expansion of e~^^^ gives e~^^ Q-bit„) ^i _|_ o{l)), so 
/....< ^^(^(2t.)'^/^e"^e-(*")-(l + 0(1)). 
On the other hand, since t„£ = {tnY^^ and t„e^ = (t„)~^/^, in Formula (11), one obtains 
/<6+. = (^y ' e-"^ (^ V^(b) rf/i,,rf,,e(b)') n^ y-^ e-'-"^ ds^ (1 + 0(1)) 



(t^ 

\ d/2 



d/2-1 h2 / f \ / /'+°° \ 

/ V^(b) t//isurface(b) / e-"* rft (1 + 0(1)) 



2tx ) 



^-"^[l /l^rf/^surface(b)Vl+0(l)). 

\^ -Dmm "-III! / 
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The first term is negligible in comparison to the second term, so, one can reasonably 
conjecture that 

limSUp I ( ^ ) {tnb) exp ( -^ J IP[X„ G t^B] j < / V^(x) /isurface(c?x). 

The reason why one has only a inequality with a limsup and not a equality with the 
limit is that the spherical integral with B a closed set (which does not cost anything since 
Buiin = {B)uiin) is only upper semi-continuous, see the following Lemma. Therefore, 

limsup (/ lB(rc)V^(rc)c//igd-i(c) ) < / lij(6c) ^(6c) o?/^§d-i(c), 

and then the discussion is the same but with inequalities instead of equalities in the 
estimates of I<b+e- 

Lemma 4.2. Let ip be a non-negative continuous function, and C a closed subset ofW''. 
The function 

Fc -.r^ I V^(rx) lc(rx) rf/i§d-i(x) 

is upper semi- continuous on (0,+oo). 

Proof. If instead of Ic we had a continuous function, then by standard results on integrals 
of continuous functions in two parameters, the integral F would be continuous in r. How- 
ever, by using the distance function to C, one can write \c as the infimum of continuous 
functions, and therefore, Fc is itself the infimum of continuous functions, whence upper 
semi-continuous. D 




Figure 4. In the multi-dimensional mod-Gaussian setting, the precise up- 
per bound is given by the surface integral of %p on -Bmin, the set of points of 
B that also lie on the sphere of minimal radius h. 



Now working in the more general case of an arbitrary symmetric matrix An = \/t^A, 
we get the following upper bound: 

limsup I r^j {tj)) exp r-^\ P[X„ G [A^fB] J < j ^(x) /isurfacc(rfx) (12) 
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where b = mi{^y{Ah \ Ah) \ b G B}; -Bmin = {x G i? | a/(v4x | Ax) = 6}; and /isurface is 
the surface measure on the A-sphere of radius b obtained by the hnear automorphism A 
from the surface measure on §'^^^(0,6) — in particular it has the same mass, so beware 
that unless A = I^, this is not the surface measure obtained by restricting the Riemannian 
structure of the euclidian space M.'^ to the A-sphere. The goal of this section is now to 
prove the estimate (12) assuming only hypotheses (a), (b) and (c); and to give a sufficient 
condition for the limsup to be a limit. 

4.2. Smoothing techniques and estimates for test functions and domains. In 

this section, we assume that 4.1. (a), 4.1.(b) and 4.1.(c) are fulfilled, and we set 

_ r(x„) _ ^ ^ 



where E = A ^ . We look for the analogue of Lemma 3.1 in a multi-dimensional setting. 
Though it might look awkward to work with multi-variables cumulative distribution func- 
tions F{x''^\ . . . , x'^''^) = P[Y.ft ' < x''^\ . . . , Yn < x^'^'^], it seems to be the only way to 
obtain sharp estimates of the probabilities of hypercubes ni=i['^*'*^ ^^*^] under the law /x„ 
of Y„. By sharp we mean with a remainder that is a o{l/^/t^), and this is needed for 
our multi-dimensional principle of large dimensions (see §4.3). Nevertheless, one can also 
give other more general (but less precise) estimates, e.g., 



[1) estimates of the difference between E[F(Yn)] and G[F] = J^^^ -^(x) gijx.) (h^ for F 

]_ 



in a suitable class of test functions, where g(yi) = j^-^^e "''" ^^ is the density of 



a standard rf-dimensional Gaussian variable. 

(2) and estimates of the difference between fin{B) and G{B) where B runs over a class 
of regular domains, for instance, the class of all convex bodies with a sufficiently 
regular boundary. 

Though not needed for our results of large deviations in several dimensions, these new 
estimates seem of interest in their own right. They should be compared with the approx- 
imation results of [BRIO], which are also proved in several dimensions, but only in the 
setting of sums of i.i.d. random variables. 

4.2.1. Estimates for test functions. A preliminary step consists in smoothing the distri- 
bution of the Y^'s, and to this purpose we shall use standard methods of convolution. 
Set 

^(fc) 1 fl-cosTx\ fsmTx\ _ [ ^^2 ) [-^^) 



^ ^ ' I{k) V Tx"^ ) \ TX ) r°° n-cosx \ /'sinx\2fc ^^' 

u — OO V X / \ X / 

and in dimension d, A^jjj(x) = Y[i=i^T i^'^''^)- One defines the convolution of A^^^^ 
with a test function F by 

{A^^'l * F)(x) = / F(x - y) A(.^!,,(y) dy. 

Since A^^^ is non- negative and of total mass J^^ Arp^a('y) dy = 1, if yU is a probability 

measure, then one can define a positive normalized linear form on the space ^^(I^'^) of 
continuous compactly supported functions by 

(A(.%*/.)(F)=MA^'i.*F), 
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and it corresponds by Riesz' representation theorem to a unique probability measure on 



,(fc) 



(fc) 



(k) 



the convolution /i^ '' = A^j^^ * /i of /i with the kernel A^ ^^ . This new measure has 
the following additional properties, which are proved in Appendix 9.1. 

Lemma 4.3 (Kernel with compactly supported and smooth Fourier transform). 

(1) Suppose F Lipschitz with constant m (w.r.t. the euclidian norm on W^) and 
hounded in absolute value by C . Then, 



.Wi 



K^(F)-Mi^)l<yrf^f^2fcTr 



(fc) 



(2) The Fourier transform of Arp^a takes its values in [0,1], is supported by the hy- 
percube [-{2k + 1)T, {2k + VjT^ and is of class ^^^ on R'^. 



By combining these properties and the hypotheses of mod-convergence, we get: 

Lemma 4.4 (Berry-Esseen estimates for test functions). Let F be a function which is at 
the same time integrable, bounded and Lipschitz; C = ||-F||oo and M a Lipschitz bound for 
the fluctuations of F. For any k > \^^~\, there exists a constant K^ such that 

\E[F{Yn)]-G{F))\ < -^ (27{2k + 1) Vd{CM^''+')^^{dtn)^^ + CK^ 



O 



M,C,k 



{tr 



\ 2 4fc+4 



and Kk depends only on k, on the speed of convergence of the renormalized characteristic 
functions on the compact set Z'([— i,i]'^), and on the behavior of ip on this compact set. 



Proof. Fix k > \^] ■ If yUn is the law of Y„, then 



(k) 



\UF)-G{F)\ < \^^4F)-^^l'>{F)\ + \^^^An'^-G'T\n + \G'T'iF)-G{F)\ = a + ^+^ 



W-cL^)f 



,(fc). 



For a and 7, a correct bound is given by the property (1) of the previous lemma, 
and we cannot hope for much better since /i„ might be a singular measure {e.g. 
without density). Hence, 

« + 7<27rf-+4 (^^^^ 



For /3, we use Parseval's theorem. Suppose first that F is in L^(]R'', (ix); then. 



.{k) 



ik)i 



f^l:MF)-G'T'iF)\ 



FM 



dfJ'n 



(k) 



dG 



(k), 



dx 



(ix 



(ix 



{2ny 



(k) 



F{y)A};>{y) /i;(y) - G(y) rfy 



The identity still holds for F e L\R'^,d:si), because Li(M'^,dx) nL'^{R'^,d^) is 
dense in L^(]R"', dx). Denote ^„(z) the left-hand side of the Equation appearing in 
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§4.1. (a). With T = Q^ and y G [-(2A; + l)r, {2k + 1)T]'^, one has 

/^(y) -G(y) = (^„(ir„y) - l)e-^ = / (V^;(tir„y) | ir„y) e"^ rft 

Jo 

= -= / (i:<(tiZ'„y) I iy) e-^ dt, 

yt'n Jo 

and therefore, 



l^'n^AF) - GPiF)\ < J^T^TTr f I ^(y)^(y) (^^^(^i^ny) I ly) e-^ dy 
<^ [ [ i^(x)G„,i(x)rfx 

V'^n Jo JW 



dt 



dt< ^^ f I \GnA^)\d^dt 

V^n Jo 



where Gn,t is the inverse Fourier transform of A^^^(y) (Z'-?/'^(tiZ'„y) | iy) e 



T. 

that is to say that 



G„,(x)= / A5.'i.(y) (i:<(tir„y)|iy)e-^-<'^l^>c/y. 



However, 

(1) A^^l, is of class ^^^+1 on R'^ and vanishes outside [-{2k + 1)T, (2A; + l)r]^; 

(2) by the hypothesis of mod-Gaussian convergence, ip'^tiEny) and all its deriva- 
tives up to order {d + 1) are uniformly bounded on [—{2k + 1)T, {2k + 1)T]'^ 
(actually one does not need here the convergence of these quantities); 

II 11^ 

(3) and ye 2^ is a Schwartz function on M.'^. 

Therefore, 

sup ||(l + ||y||)'D"/7„,i(y)||^<oo, 

te[0,l], neN, \a\<d+l, l<d+l 

SO by {d + 1) integration by parts, [(^^^((x)! is bounded by L (1 + ||x||)^'^^-'^, with 
L constant that only depends on k and on the behavior of the ipn^ and their 
derivatives up to order {d+2) on ^([-(2^4-1)19, (2^-^1)19]'^). Since {l+\\:si[\)-'^-^ 
is integrable on M'^, there is another finite constant K with the same properties of 
dependence and such that 



sup ( / |G„^t(x)|(ix ) 

),ll, neN VJRd / 



CK 



SO /3 < ./^ 



«e[o,i], neN 



To fix the ideas, set 9 = l/{2k + 1), and denote K^ the corresponding constant, which 
only depends on k and on the behavior of ip and the ip^s on Z'([— i, i]'^) (more precisely it 
suffices to have bound on their derivatives up to order d + 2). Then, 

lElf (Y..)] - G(F))l < 27(2. + 1) ,m (OJf^ ^ £^ 

In particular, |E[F(Y„)] — G(F))| = O f .^ ■)i"/2-e ) for any £ > 0, with a constant that only 
depends on e and M, C. D 
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4.2.2. Estimates for Gaussian regular domains. Starting from the previous Lemma, one 
can estimate the difference of probabihties |/i„(i?) — G{B)\ for B domain in M*^, but one 
needs B to be "sufficiently regular". The right condition to impose is the following. Denote 
i?*^ = {x I dijsijB) < s} for s > 0, and B~^ = {{B'^yy. When i? is a convex body, these 
are the Minkowski sum and the Minkowski difference of B with the ball -B(o,£) • 

Definition 4.5. A domain B gM.'^ will be called Gaussian regular with constant G if for 
all 6 > 0, 

1 r iixii2 . „ 1 



(27r)'^/2 



dx < Ge 



B^\B 



{2'kYI^ 



^ rfx < Ge. 



B\B- 



Example 4.6. Consider an hypercube, possibly with infinite bounds and possibly rotated 
by some u G SO((i, M): 



5 = MrQ[a(^\6(')]), MGSO(rf,M), a^^.h^^eR. 



vi=l 



As shown by Figure 5, the first integral in Definition 4.5 is bounded by 



2d 



X 

e~^ dx 



(x(2))2 + ... + (^(d))2 

e 2 dx < 



de. 



■4-4-- 

I I 
■i- ■ 



. 4 - -^ - 



B 



-I 

I 
I 
I 
I 
I 
I 
..I 

II II 



Figure 5. For hypercubes, the Gaussian mass of the e-boundary is 
bounded by the mass of 2d bands H x [0, e\ with H hyperplane, and therefore 
by a constant times e. 



The same bound holds for the second integral, so, hypercubes are Gaussian regular with 
constant ^21 tx d. More generally, the same discussion shows that any convex polytope 
with F faces on its boundary is Gaussian regular with constant 



27r 



Example 4.7. A more general set of domains for which our results will apply consists 
in convex bodies, the asymptotics of Laplace or Fourier transforms of the indicators of 
convex bodies being a well-known problem (see e.g. [Stc93]). The Gaussian regularity of 
convex sets follows for instance from [BRIO, Chapter 1, Theorem 3.1]; for completeness 
we give in Appendix 9.2 a quite shorter proof of this result. 
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Theorem 4.8 (Berry-Esseen estimates for probabilities of Gaussian regular domains). 
Fix 6 > 0. For n > n,; big enough and for all Gaussian regular domain B with constant 
G, 

\^,r.{B) - G{B)\ < -^ 

Here n^ depends on d, e, E and on the behavior of the ipn 's on E{[—i, i^), but is indepen- 
dent from B and G. 

Proof Fix k > \^]. For M > 0, set 

These are Lipschitz functions with constant M and bound G = 1. One has 



|2 



f e 2 

fin{B) - G{B) < /i„(V'B,M) - G(5) < \fln{i^B,M) " G{iPb,m)\ + / TTT^wTp ^^ 

llxlp 

[ e 2 

lJin{B) - G{B) > fJ.n{(pB,M) " ^(^) > -|/^n(0iJ.A/) - Gr{4>B.M)\ " / .^ ..,^ C^X. 

Jb\b-^/m [2n)'^/^ 

By hypothesis, the Gaussian integrals are bounded by ||, and we can use Lemma 4.4 in 
order to bound the remaining terms: 

2fc+3 2k+l 



(t„)SS M (i„)V2 

for any k > \^^~\ ■ It suffices now to choose the best M , which is 

2fc+2 

1^27 (2fc + 1)2 ^i±f 
and yields the inequality 

2fc+l (2fc+2)(2fc+3) 

|/x.(i?) - G{B)\ < 14.08 v/2^TI (2.+.)(2.+2) + 77^^, 

(t„) (4fc+3)(4fc+4) I'^nJ 

the co,.ta,. 14.08 co.esponding to the largest value of ^ SJf (^Iggi!) " (when 

fc = 1). Therefore, for n > ni^ with n^ depending only on k and on the behavior of the 

^„'sonr([-i,i]''), 

29 ^G{2k + l)d 

y 



\fXn{B) - G{B)\ < ^ (2fc+l)(2fc+2) ) 



(t^) (4fc+3)(4fe+4) 

making again some simplifications on the constants involved in this upper bound. Since 

d <2k — 1, setting £ = ^, this gives 



29 \/G 

\^4B) - G{B)\ < — / , -. 

"^ (in) 4"^ (t„)iS'= 4(4fc+3) 
. „ ^1 1 

For n big enough, -^(t„)^^'= 4(4fe+3) jg bigger than 1, so, possibly raising the value of 
^k = n^, we end up with 

\fin{B)-G{B)\< ^ 



(tn) 



I--' 
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and this holds for any Gaussian regular domain. D 



4.2.3. Estimates for hypercubes and polytopes. The previous result shows that for any 
6 > 0, there exists an integer rig such that 

1 



|/i„(C)-G(C)|<- 

(in) 4 

uniformly over hypercubes C in M.'^. However, in dimension rf = 1, we already know that 
the difference is in fact a uniform 0((t„)~^/^), so we might reasonably conjecture that the 
previous estimates are not optimal in this case. Indeed, the following is true: 

Theorem 4.9 (Berry-Esseen estimates for probabilities of polytopes). For every d-dimen- 
sional hypercube C = {'^i-i't^^^'Vi, i^*^ £ [a'^*\6'^*^]}, with the Vi's not necessarily orthogo- 
nal, and possibly with infinite bounds, 

f^niO = j^^ (X""^ (1 + (r>'(o) I x))dx^ +^(;^) ' 

with a remainder uniform on hypercubes. The same uniform estimate holds over convex 
polytopes with a bounded number of faces F. 

Remark 4.10. The estimate on multi- variables cumulative distribution functions can in 
turn be used to get better estimates on test functions, but with much stronger hypotheses 
than in §4.2.1. Hence, if F is a Schwartz function (or at least Schwartz "up to order d" in 
the same sense as in the proof of Lemma 4.4), then by integration by parts one can: 

• write fin{F) as the integral of the d-th derivative of F against the cumulative 
distribution function of Y„; 

• replace this cumulative distribution function by its Gaussian approximation; 

• and undo the d integrations by parts to obtain G(/) up to a 0((t„)^^/^), instead 
ofaO((tO~'/'+^)- 

However we need F to be much smoother than before, and this only removes the —e in 
the exponent of the estimates. 

Proof. The main difference between the polytope case and the previous computations is 
the possibility to use the monotony of the cumulative distribution function 

d 



F(x^^\...,x^'^^) =P 



Yn = J2y'-'^Vi, y«<x 



i=l 



/i„(C(a;«,...,a;(^))) 



with respect to all of its parameters. Denote 

G(a:W,...,x(^)) = -i^/ e-^(l + (v|x))rfx; 

[271 f'^ Jc{xm,...,xW) 

5(x(i),...,x('^)) = (F-G')(xW,...,x('^)) and r] = sup^.(i)^,..^^(d) |(5(a;«, . . . ,x('^))|. The d- 
dimensional analogue of [Fcl71, Lemma XVI. 3.1] yields 

M 
V < ^Vt + y^ 
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with 

r]T= sup \5t{x^'\...,x^'^^)\ 

xW,...,xid) 

= sup |(Aj'i,*F)(a:«,...,x('^))-(Aj'J,,*G)(x«,...,x('^))| 

where M is a multiple of a Lipschitz bound for G, that is to say a constant depending 
only on v. The monotony of F plays here a crucial role: indeed, Feller's proof uses the 
fact that if {x^^\ . . . , x^^'^) are parameters such that r] = 5{x^^\ . . . , x^'^^), then for a certain 
vector m G M*^, 

5(jW + e(i\...,j('^)+e('^))>F(x«,...,x('^))-G(J«+e«,...,5f('^) + eW)>| + (m|e) 

in a neighborhood Ylf^^[x^'^ -e^'\x^'^ +e^'^] of {x^^\ ..., JW) = (x^^) +e(^), . . . , x^'^^ + e^'^^) . 
Now, the same arguments as in Lemma 3.1 yield the uniform bound of the theorem for 
infinite hypercubes C{x^^\ . . . , x^*^-*) — notice that the term r]"'{0) disappears here since 
7]'" = 0. Then, for any convex polytope C bounded by F faces (including bounded hyper- 
cubes with F = 2d), one can use inclusion-exclusion to write Ic as a linear combination 
of 2^ cumulative distribution functions: 

/efaces{C) f&F{C) AcF{C) 

where lj+ and If- denote the indicator functions of the two half-spaces determined by 
the afiine hyperplane /, with the -|- side containing C; and for A subset of the set of faces 
F(C), C{A) is the corresponding infinite hypercube HfeA /"■ Since the bound o((t„)~^/^) 
for \fin{C) —G{C)\ is uniform over infinite hypercubes, we get from (13) the same uniform 
bound for convex polytopes with a bounded number of faces. D 



4.3. Estimates of probabilities of Borel sets in a multi-dimensional setting. We 

are now ready to prove the large deviations principle (12); again, we suppose A = E = I^, 
as this only amounts to easy changes of variables. Fix x 7^ 0, and as before, introduce a 
random variable X„ with probability 



^-K'^l-^l!^'-!^^!' 



where P„ is the law of X„. The characteristic function of X„ is ,, ^ , so modulo 



2 



exp I — h (t„x I z) 

which is the Laplace transform of a Gaussian random variable with mean t„x and co- 
variance matrix t„/d, it converges locally uniformly to i^/^f ■ Notice that if x stays in 

a compact subset of M'^, then the bounds measuring this convergence can be made in- 
dependent of x, so the previous theory applies locally uniformly with respect to x. In 
particular, for any convex polytope C, 



-'^-n ''71 ^ ^ 



V^ 



^V'(x) 



(C) 



^ ' (14) 



(tnY/ 



where G^/(x) is the signed measure with density ( 1 -|- V-^iP-, ) e 2 rfy. 
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We fix a polytope neighborhood i? of in the {d— 1) vector space orthogonal to x, and 
we consider the convex polytopes 

C(0, B) = {{X- l)x + Ab, heB, A > 1}; 

^(x, 5) = X + C(0, B) = {A(x + b), b G fi, A > 1}, 

see the following Figure. 



C(x,S) 




Figure 6. For cones C(x, B) with polytope basis B, one can approximate 
the probability P[X„ G tn C] by the conjectured upper bound, and this 
locally uniformly in x. 



Denote yU„ the law of 






and suppose ||x|| = x. One has 



P[X„ G t„C(x, B)] = 0„(x) / lyet„c(x,s) exp(- (y | x)) Q„((iy) 

= ^n(x) exp ( — ^j / luev^c(o,B) exp(-v^(u| x))/i„(rfu). 

By slicing the cone C(0, i?) orthogonally to the direction x, we see that the integral is 
also equal to 



where Cn,x is the part of ^yt^C{0,B) consisting of elements (/ix + (/x + ^/t^)h) with 
< /i < A. We can now replace fin{Cn,\) by its approximation (14), and as in dimension 
1, it suffices to take the Gaussian mass, as this is the only part that will give something 
bigger than o{l/^yt^). So, 

t„x^ exp(-At„x^) ( G{Cn,x) + o( 



X 



{2nY/' 



POO 

/ (A + v^) 
Jo 



y/tn 



d\ 



d~l 



2 A^x^ + (A + vO 
exp — At„x 



B 



dh dX 



up to o((t„) ^^'^) on the last line. 



Let 6 be function going to as its argument goes to zero, and such that the uniform 
remainder in the previous formula always satisfies 



1 



V^n 



< 



1 



y/inj 
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We suppose that B = -^ with 5* polytope of surface measure of order , \. e ( ^= 1 ; here 

by of order we mean equal up to a positive multiphcative constant bounded from below 
and from above. Under this assumption, the asymptotics of the integral are 

1 



/^surface!,-'-'/ 



<^e^ 



ytn V \^n 



(27r)^/-x 

and the estimate is of order (t„)'^/^~-'^~*^'^~^)/^/i(S'), that is to say of order -k=^\-k 

so larger than the remainder. Indeed, in the integral over S, ||b|| is always a o(l) by 
assumption on the size of the surface, and then one only has to compute a one dimensional 
Gaussian integral, which is done as in the proof of Theorem 3.2. So: 

Lemma 4.11. Fix x 7^ and a {d — 1)- dimensional polytope S orthogonal to x and of 
surface measure of order , \. e i -k= ) , with e chosen as before. One has 

S 



Z^i X 



exp ( ^ ) P 



X„ e tnC( X 
un to a remainder smaller than -4r= £^ '^ 






This leads finally to the analogue of Theorems 3.2 and 3.9 in several dimensions: 

Theorem 4.12. Fix a measurable part S of the A-sphere {x, ||Ax|| = b}, b > 0; and 
denote S^ = [1, +00) S. Locally uniformly in b, 



d 



P[X„ G [A^rS^] = (^)'exp (-^) (^ ^ rf/i.,rface(rfx)) (1 + 0(1)). 

Proof. Supposing again A = E = Id, it suffices to approximate the surface S by polytopes 
-1= with the hypotheses of the previous Lemma (see the next Figure). More precisely, we 
shall take two kinds of approximations: 

• outer approximations iSext, such that the projection of iSext onto the sphere is 
included into S and converges in measure to it; the disjoint union of polytopes 
5+t is always included into <S+ and lim /^^^^ ^(b) (i/isurface(b) = /^ V(b) (i/isurface(b). 

• inner approximations Sint, such that the projection of Sint onto the sphere contains 
S and converges in measure to it; the disjoint union of polytopes S^^ always 
contains 5+ and hm/^.^^ ^(b) (i/isurfacc(b) = fgi'ih) (i/i<,urface(b). 

In each case, one needs a number of polytopes of order larger than e~^ {tn)^'^~^'^^'^, so let 
us take an approximation with e~^ {tn)^'^~^^^'^ such polytopes. The sum of the remainders 
will be smaller in order than 

^(t„)'^=e(t.)'^/^-^=o((t„)'^/^-^). 



V n 

Then, for the outer approximations, 

P[X„ G (A„)'5+] > P[X„ G (A„)2 5+t] 

d 



^(fe)'-p(-^)(ll!g— ^(^+"(1)) 
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Figure 7. The two approximations of a part of the sphere by polytopes. 

where 6+ is the norm of the points that are the centers of the polytope faces of the 
approximation. Since the polytopes are of width o(l/y^), one can replace 6+ by b 
without changing the previous formula, which proves the lower bound 

liminf ( r^y (tnb) expf^ j P[X„ G {A^fB] j > /" ^(x) /isurfacc(rfx). 

Taking inner approximations, one obtains the same upper bound for the linisup. D 

Theorem 4.13. Let B be a Borelian subset ofW^. With the notations of Formula (12), 

limsup ( \^\ {tj>) exp(^^ j P[X„ G [A^^fB] J < j V^(x) /i,,rfacc(f^x) 

with equality if for instance B = C is dosed and the function Fc of Lemma 4-2 is con- 
tinuous at r = b. 

Proof. These are now the same arguments as for the toy-model, cf. §4.1. D 



5. First examples 

The general results of Sections 2-4 can be applied in many contexts, and the main 
difficulty is then to prove for each case that one has indeed the estimate on the Laplace 
transform given by Definition 1.1. An explicit formula for this characteristic function, 
which is mainly accessible for sums of i.i.d. variables (§5.1), is fortunately not required. 
Therefore, the development of techniques to obtain mod-0 estimates becomes an interest- 
ing part of the work. In probabilistic number theory, this will usually be related to the 
Selberg-Delange method (§5.2), whereas for random combinatorial objets, we will have 
to combine the methods of §3.2 with some new tools (Sections 6-8). In this section, we 
detail examples for which the mod-0 convergence has already been proved before {cf. 
[JKNll, DKNll]). 
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5.1. Sums of independent random variables. Though the theory of inod-^ conver- 
gence is meant to be used with complex random variables (statistics of random combi- 
natorial objects, arithmetic properties of random integers, sums of dependent random 
variables, etc.), it already gives interesting results for sums of independent and identically 
distributed random variables. Let X be a random variable in M'' with entire characteristic 
function, and (X„)„gN be independent copies of it. The sum S„ = Yl'i=i ^i ^^^ character- 
istic function e"'^"^''^\ so Z„ = n^^^^Sn converges modulo a Gaussian of mean n^/^E[X] 
and variance matrix n^/'^ Var[X], with limiting function 

^{z) = exp ( 

In particular, the following moderate deviation principle holds: assuming E[X] = and 
X truly (i-dimensional, i.e., A = var[X] = E[X®^] is symmetric positive definite, one has 

P rS„ > ^2/36 ~ ^ l^e ^ / exp ^-^ ^— ^.urface rfz 

where E = A^^. This estimates holds for b of order bigger than n~^^^, and up to order 
Q^^i/12^ according to the discussion of §3.2. If X is symmetric in law (X and —X have 
same law), then the third cumulant vanishes and one has to look for the fourth cumulant: 
hence, the random walk S„ satisfies in this case the moderate deviation principle 

P[||rS„|| , „»/%] . (!^e-^ (/ exp ('^^lP9p^) ,_„.)) 

(27r)2 \Js''--L(o,i) V ^4 / / 

for b of order bigger than n~^^^, and up to order o(n^/^^). 

A simple consequence of these mult i-dimensional results is the loss of symmetry of the 
random walks on Z'^ conditioned to be far away from the origin; this loss of symmetry 
has also been brought out in dimension 2 in the recent paper [Benl3]. Thus, consider the 
simple 2-dimensional random walk S„ = X]r=i-^«' 'where Xj = (±1,0) or (0, ±1) with 
probability 1/4 for each direction. The previous discussion and the values of cumulants 

^(4)((S)fJX)®4) = ft:(4)((CJX)®^) = K^^\{^Xf\ (53X)®2) = -- 

leads to the following limiting result: if X„ = RnG^^" with 9n G (0, 27r), then 
lim F\e„ e {Oi, 62) I Rn > rrr^'^] = / F{r, 9) d9 



exp 



with F{r, 9) = \ rUsin2oyA ^^ drawn hereafter. 

Jo t^^Pi 96 j'^^ 

This function gets concentrated around the two axes of M^ when r — t- 00, whence a loss 
of symmetry in comparison to the behavior of the 2-dimensional Brownian motion (the 
scaling limit of the random walk). In dimension d > 3, one obtains the similar result 



hm P 

n— >oo 



^" GA 



IS 



|S.„|| > rn' 



3/4 



K{r) fexpl-^ J2 (x«x(^))M/.3urface(rfx) 



l<i<j<d 

for any measurable set A C §^^^^(0, 1). The conditional probability is therefore concen- 



trated around the axes of M . All these estimates hold up to r = o{n 



1/12\ 
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Figure 8. The function F{r, 6) measuring the loss of symmetry of S„ when 
conditioned to ||S„|| = 0{n^^^) (using Mathematica®). 

Another similar setting in which our Theorems apply readily is the Poisson approxima- 
tion of a sum of Bernoulli variables with "small" parameters pk- Set X„ = X]fe=i ^iPk)y 
where B{p) is the random variable equal to 1 with probability p and to with proba- 
bility 1 — p; and where these Bernoulli variables are supposed independent. Under the 
assumptions J2T=iPk = +c« and J2T=iiPkY < +oo, the Laplace transform of X„ writes 
as 

n / oo \ 

E[e^^"] = J](l + pk{e^ - 1)) = e*-(<=^-i) 1[{1 + pu{e^ - 1)) e'^^^^'-^) (l + o(l)) , 
fc=i \fc=i / 

where t„ = X]fc=iPfc ~^ ^^- The infinite product il){z) is indeed convergent, because 
(1 +p,(e^ - l))e-^'=(^^-^) = 1 - ^(e^ - 1) + o{{puf) 

and Yl^=i{Pkf' is finite. So, one has mod-Poisson convergence, and by Theorem 2.4 the 
Poisson approximation holds with precise large deviations 

p-tn({l+£)log(l+£)-e) ^/TTP7 

nXn >{l + e) t.] ^ ^== ^^-— He). 



5.2. Logarithmic combinatorial structures. The previous example is a toy-model 
for the so-called logarithmic combinatorial structures, see [ABT03, FS90]. Non-trivial 
examples falling in this framework are the number of cycles of a random permutation 
(Example 2.12), and the number of distinct prime divisors of a random integer or of a 
random polynomial over a finite field. Denote uj{k) the number of distinct prime divisors of 
an integer fc, and w„ the random variable uj{k) with k random integer uniformly chosen in 
[n] = {1, 2, . . . , n}. The random variable a;„ satisfies the Erdos-Kac central limit theorem 
(c/. [EK40]): 

g; -log log n^^^^^^^ 
Vlog log n 
Indeed, the Selberg-Delange method (see [Ten95, §2.5] and the next paragraph) yields 
E[e^'^"] = eii°siogn+j){c^-i) ^^^^z _ ^^ ^^^ ^^z _ 1) (1 + o(i)) ^ 
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where 7 = 0.577... is the Euler-Mascheroni constant, and IIa{x) = YlaeAi^ + f ) ^~^ ^^^ ^ 
part of N* = |1, +00]. This niod-Poisson convergence leads now to the principle of large 
deviations: 

PK > (1 + £)(loglogr2 + 7)] ~ ^ np(£) Un*ie). 

y'ZTT log log n e 



Similarly, denote Un^q the number of distinct irreducible divisors of a random nionic 
polynomial of degree n over the finite field F^ with q elements. It is shown in [KNIO, 
Theorem 6.1] that the characteristic function of ujn,q has for asymptotics: 



E[e^--.l ~ e('°s«)(e^-i) ^.^ T\(l- -7^1 (l + 



gdcgTT _ I 

where the product runs over monic irreducible polynomials over F^. Hence, if Iq^d = 
2 X]e|d/^(^) 1'^^'^ ~ li ~^ 0{q'^/'^) is the number of irreducible polynomials of degree d, then 

E[e.-.].e(-)'-"^n(l-^) (1 + ^) . 

from which one deduces that P[a;„,g > (1 + e) logn] is equivalent to 

logn{(l+e)log{l+£)-e) ^/fip^ ^(( e\( 1 Y \'^'''' 



I I I I I -I 1 I I 

id 



n iH-^ 1 



VSvrlogn er(l + £) \}^\\ gv V 9 

In particular, the cardinality q only plays a role in the large deviations of lOn^q-, but not in 
the central limit theorem. 



5.3. Additive arithmetic functions of random integers. The previous example con- 
cerning the number of prime divisors of a random integer can be generalized as follows. 
Let / : N — !■ M be an arithmetic function with the following properties: 

(i) / is additive: jivnn) = f{m) + f{n) ii m A n = 1; 

(ii) /(p*^) = g{p, k) is bounded as a function of prime numbers p and integers k (we shall 
denote C a bound); 

(iii) f{p) = g{p, 1) = 1 + O(^) for some a > 0. 

For the last condition, by dividing / by some constant, one can assume more generally / 
"almost constant on primes". Denote then 



F{s,w) = J2 



W'' 



n=l 



the Dirichlet series of {w^^'^^)n£N, which is well-defined for w E C^ and a = 3fJ(s) > 1. 
One forms the ratio 

F{s,w) 
G[s,w] = -—r\ — , 

where C,{s) is the Riemann zeta function 'Y^^=i ~^- The choice of complex logarithms 
implied by the writing C,{s)^ is the one of [Tcn95, II.5.1]. On the other hand, it is well- 
known that there exists a constant | > c > such that (^{s) does not vanish on the 
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domain 
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D 



s = o" + ir G C 



a> 1- 



1 + log+ \t\ 



C <s 



a > 



see e.g. [Ten95, Theorem 3.15]. We shall assume c < a in the following. 



Proposition 5.1. Suppose w = e^ with \'^{z)\ < -^ (hence, w stays in a circular band). 
The map s i— )■ G{s,w) is holomorphic on the domain D. 

Proof. Since / is additive, one can write for a > 1 the Euler product 



F(s,^)=n(i 

pGP 



W 



gip^) 



pS 



nf(P) 



+ 



w- 



gip,2) 



P 



,2s 



+ 



Notice that by hypothesis on w and s, 1 + ^^^-^- is non zero for all p G P, since 



w 



fip) 



pS 



< 






< 2^«(^) < 1. 



Therefore, the Euler product can be rewritten as 

F{s,w) = F{s,w) n(l 



w 



f{p) 



pS 



where F{s,w) is uniformly convergent and holomorphic on the domain a > ^. Similarly, 

with Z{s,w) holomorphic on the domain a > ^, and non-vanishing on D. It remains 

therefore to prove that 

/(p) 



1 + — 

peP P' 



is itself holomorphic on the domain D. This is clear, since the logarithm of the general 
term log ( 1 + ^^^-^ ) — log ( 1 + :^ j behaves as 

D 



!^ (e^(/(p)-i) _ 1] 



w / z 
~ — O 



p° p° 

and 3f?(s) + a > 3f^(s) + c > 1 on the domain D. 



K(s)+a 



The previous Proposition allows one to apply [Ten95, II. 5, Theorem 3], which is a 
Tauberian theorem for '^k<:n'^^^^^ ■ Set 



^ oo 

Y{sM = -{{s-i)as)r = Y. 



lA^) 



is -I] 



3=0 



which is an analytic function in s for |s — 1| < 1. 



Proposition 5.2. Denote X„ a random integer uniformly chosen in [l,n]. One has the 
mod-Poisson convergence 



n 



^/(fc)=e(c^-i)i 



- E 



fc=i 



.fc=0 



(logn) 



+ 



(logn) 
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where the coefficients Xk{w) are defined by 

^ ^ h+j=k ■' 

In particular, \q{w) = lim.^i ri§(^^- 

Corollary 5.3. For \e\ < -^, one has the asymptotics 

-(loglogn)((l+e)log(l+£)-£) /I I ^ 

P[/(X„) > (1 + e) loglogn] ^ ,^ , , ^^—- Xo{e). 

y/zir log log n e 

The same estimates have essentially been obtained by Radziwill in [Rad09]. On the other 
hand, our reasoning makes clear the link between the notion of mod-Poisson convergence, 

which is a statement on the ratio 

E[e^^"l 



) 



(E[e^^])^"^ 
and the Selberg-Delange method, which is a statement on the ratio 

F(s,e^) 
with F the Dirichlet series of an additive arithmetic function. 

5.4. Statistics of random combinatorial objects and singularity analysis. We 

now present the method of singularity analysis of generating series which is in spirit 
comparable to the Selberg-Delange method introduced above but which is used to cover 
a variety of situations which do not fall in the domain of applications of the Selberg- 
Delange method. The idea is to use contour integration techniques in order to extract 
the asymptotic behavior of the coefficients of some generating series of interest. As in 
the Selberg-Delange method described in the previous section, we shall have two complex 
variables and the coefficients of our generating series can be themselves the moment 
generating functions of some random variables. Rather than stating general and abstract 
results, we would rather illustrate the power of this method (and at the same time the 
fact that mod-Poisson convergence is genuinely a higher order central limit theorem) by 
considering the example of the total number of cycles of random permutations under 
the so called weighted probability measure. For more details the reader can look at the 
monograph [FS09] or the paper [FO90]. 

From now on we shall follow the presentation in [NZ13]. Denote X„(cr) the number of 
disjoint cycles (including fixed points) of a permutation a in the symmetric group ©„. We 
write Cj {a) for the number of cycles of length j in the decomposition of cr as a product 
of disjoint cycles. Let O = (^m)»n>i be given with 9^ > 0. The generalized weighted 
measure is defined as the probability measure Pe on the symmetric group ©„: 






m=l 

with hn a normalization constant (or a partition function) and Hq = 1. This model 
is coming from statistical mechanics and the study of Bose quantum gases (see [NZ13] 
for more references and details). It generalizes the classical cases of the uniform measure 
(corresponding to dm = 1) and the Ewens measure (corresponding to the case 6m = > 0). 
It has been an open question to prove a central limit theorem (as in the Ewens measure 
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case) for the total number of cycles X„ under such measures (or more precisely under 
some specific regimes related to the asymptotic behavior of the Om^s). The difficulty is 
arising from the fact the there is nothing such as the Feller coupling anymore and that 
the probabilistic methods fail to hold here. We now show how the method of singularity 
analysis allows us to prove mod-Poisson convergence, and hence the central limit theorem, 
but also distributional approximations and precise large deviations. We first consider the 
generating series 

oo „ 
n=l 

It is well known that 

oo 

^/i„r = exp(^0(t)). 

n=l 

Note that in general /i„ is not known. Our goal is to obtain an asymptotic for hn and 
for the moment generating function of X„. We note r the radius of convergence of ge{t)- 
The idea of singularity analysis is to introduce a properly chosen holomorphic function 
S{t,w) in a domain containing {{t,w) G C^; |t| < r, \w\ < f}, where f > is some 
positive number, and then to consider the function F{t,w) = exp{wg{t))S{t,w) with the 
goal of extracting precise asymptotic information (with an error term) for the coefficient 
of t" in the series expansion as powers of t for F{t,w). This can be carried out if one 
makes suitable assumptions on the analyticity properties of g together with assumptions 
on the nature of its singularity at the point r on the circle of convergence. This motivates 
the next definition: 

Definition 5.4. Let < r < R and < < 7r/2 be given. We then define 

Ao = Ao{r, R, (j)) = {z e C; \z\ < R,z j^r,\ aj:g{z - r) | > 0}. 

Assume we are further given g{t), 6 > and r > 0. We then say that g{t) is in the class 
Hr,d) if 

(i) there exists R> r and < < 7r/2 such that g{t) is holomorphic in Ao(r, i?, 0); 
(a) there exists a constant K such that 

git) = e\og{- —\+K + 0{t-r) ast^r. 

One readily notes that the generating series corresponding to the Ewens measure {i.e. 
6m = 0) is of class J^(l, 9) since in this case 

1 



geit) = e\og^ ^_^ 

Consequently our results will provide alternative proofs to this case as well. The next 
theorem due to Hwang plays a key role in our example (we use the following notation: if 
G{t) = Y.n=o9ni'\ we denote [r][G] = g^ the coefficient of T in G{t)). 

Theorem 5.5 (Hwang, [Hwa94]). Let F{t,w) = exp{wg{t))S{t,w) be given. Suppose that 

(i) g(t) is of class J^{r,0), 

(a) S{t,w) is holomorphic in a domain containing {{t,w) G C^; |t| < r, \w\ < f}, where 
f > is some positive number. 
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THgti 

r"- \ i [dwj \n 

uniformly for |tf;| < f and with the same K as in the definition above. 

The idea of the proof consists in taking a suitable Hankel contour and to estimate the 
integral over each piece. There exist several other versions of this theorem where one can 
replace log(l — t/r) by other functions and we refer the reader to the monograph [FS09], 
chapter VI. 5. As an application of Theorem 5.5, we obtain an asymptotic for /i„. 

Lemma 5.6. Let > and assume that ge{t) is of class J^{r,6). We then have 

Proof. We have already noted that Xl^i ^nt"' = Q^p{ge{t)) ■ We can apply Theorem 5.5 
with g(t) = ge(t), w = 1 and S{t,w) = 1. D 

Now using elementary combinatorial arguments (which are detailed in [NZ13]), one can 
prove for each w G C the following identity as formal power series: 

oo 

^/i„Ee[exp(wX„)]r = exp{e^ ge{t)) . (15) 

n=0 

Using this identity and Theorem 5.5, we can show: 

Theorem 5.7 (Nikeghbali-Zeindler, [NZ13]). If gs{t) is of class F{r,6), then 

Ee[exp(u,A-„)l^„»<--.e-.--)(j^ + o(i)). 

Consequently the sequence (X„) converges in the mod-Poisson sense with parameters K + 
9 log n and limiting function pJ L . 

Proof. An application of Theorem 5.5 yields 

r||exp(e",eM)] = ^^(rJ^ + o(- 

with O(-) uniform for bounded w G C Now a combination of identity (15) and Lemma 
5.6 gives the desired result. D 

The above theorem not only implies the central limit theorem, but also Poisson ap- 
proximations and precise large deviations. We only state here the precise large deviations 
result which extends earlier work of Hwang in the case 6* = 1 as a consequence of Theorem 
2.4 and refer to [NZ13] for the distributional approximations results. 

Proposition 5.8 (Nikeghbali-Zeindler, [NZ13]). Let Y^ = X„ — 1 and let x eM. such that 
tnX G N with tn = K + 6 logn. We note k = tnX. Then 

tJ, ( no) .^(i 



P|K„ = .t„| = e-'.^^^ + 0^^ 

In fact an application of Theorem 2.4 would immediately yield an arbitrary long ex- 
pansion for P[F„ = 3;t„] and also for P[F„ > xt„] since the speed of convergence is fast 
enough. 
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5.5. Characteristic polynomials of random matrices in a compact Lie group. 

Introduce the classical compact Lie groups of type A, C, D: 

U(ra) = {g e GL{n, C) | gg'^ = g^ g = In} (unitary group) 

USp(n) = {g E GL(n,EI) | gg* = g*g = In} (compact symplectic group) 

S0(2n) = {g E GL{2n,'R) \ gg^ = g^g = l2n ', det g = 1} (special orthogonal group) 

where for compact symplectic groups g* denotes the transpose conjugate of a quaternionic 

matrix, the conjugate of a quaternionic number a + ib +ic + kd being a — ib — ib — kd. In 

the following we shall consider quaternionic matrices as complex matrices of size 2n x 2n 

by using the map 

., . , , f a + ib c + id\ 
a + iO + jc + kd h-). . . 

y—c + 1(3 a — lb J 

The eigenvalues of a matrix g E G = S0(2n) or U{n) or USp(n) are on the unit circle S^, 
and for g E G random taken under the Haar measure 

logdet(e'^ - g) =(iaw) log det e'^ + logdet(l - g) 

'm^ + logdet(l-^) ifG' = U(n), 
2ni^ + logdet(l - g) if G = S0(2n) or USp(n), 

so the study of the random characteristic polynomials det(z — g) reduces to the study of 

y„^'^'° = logdet(l-(7), 

which is complex-valued in type A and real-valued in type C, D. In the unitary case, 
we shall identify C and M^ in order to be able to use the multi-dimensional framework 
introduced in §4. The mean of Yn is 

(O for U(n); 

Wn]=\l^og^ forUSp(n); 
[ilog^ forSO(2n), 

and it is proved in [KN12, DKNll] (after [KSOOa, KSOOb, HKOOl]) that X„ = Yn-E[Yn] 
converges in the mod-Gaussian sense with parameters 

_f^ forU(n) 
^" ~ [log f for USp(n) and S0(2n) 

and 

r ^(l+(^+i^V2) G(l+(.-i.)/2) f^^u(n) 

V- = I cS^ for USp(r2) 

[cm&^) forS0(2n). 

Here G denotes Barnes' G-function, which is the entire solution of the functional equation 
G{z + 1) = T{z) G{z) with G(l) = 1. Thus, our large deviations theorems apply and one 
obtains: 



Theorem 5.9. Fix x > 0. OverUSp{n), 



P 



P 



det(l-^) > \l^n'2- 



G(3/2) 

exp 



det(l — g) < \h:'n2 



^ --X 



xG{3/2 + x)y/2TT\ogn 

g(3/2) 
xG{3/2-x)y/27T\ogn 



exp 



x' 


\og2n 


2 
X 


2 
log 2n ' 
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OwerSO(2n), 



Pn det(l-^) > V8nn 

Fn det(l -g) < Vs^n 
Finally, over\J{n), 



G(l/2) f xMog2n 

exp 



2 ^ 



xG(l/2 + x) V27rlogn "^ V 2 

G(l/2) / a;2 iQg2^ 

exp ' 



xG(l/2-x)v/2Zbgn 



P„[l logdetd - 9)1 > .logn] == (- /^^ G(l + 2.Lg) ''"J -P(-^log")- 

Proof. These are immediate computations by using Theorem 3.2 in type C and D, and 
Theorem 4.12 in type A. In this last case, we also get 

P„[ I det(l -g)\> rf] = P„[3fJ(logdet(l - g)) > xlogn] 

Gil + x)' 1 , 2, ^ 

— T^T T ^— exp(— a; logra) 

G{l + 2x) V4vrlognx ^^ ^ ' 

by restriction to the first variable. D 

The analogue of Theorem 5.9 in the setting of random matrices in the /3-ensembles, or 
of general Wigner matrices, has been studied in the recent paper [DE13]. They can be 
easily restated in the mod-Gaussian language, since their proofs relie on the computation 
of the asymptotics of the cumulants of the random variables X„ = log | det M„|, with for 
instance {Mn)nen random matrices of the gaussian unitary ensembles. 

6. DEPENDENCY GRAPHS AND MOD-GAUSSIAN CONVERGENCE 

Dependency graphs are a classical tool in the literature to prove convergence in distri- 
bution towards a Gaussian law of the sum of partly dependent random variables. They 
are used in various domains, such as random graphs [JLROO, pages 147-152], random 
polytopes [IV07], patterns in random permutations [BonlO]. As dependency graphs give 
a natural framework dealing in a uniform way with different kinds of objects, a natural 
question is the following: when we have a dependency graph with good properties, can we 
obtain more precise or other kinds of results than the convergence in distribution? Here 
is a brief presentation of the literature around this question. 

• In [BR89], P. Baldi and Y. Rinott give precise estimates for the total variation 
distance between the relevant sequence of random variables and the Gaussian 
distribution. 

• In [,Ian04], S. Janson has established some large deviation result involving the 
fractional chromatic number of the dependency graph. 

• More recently, H. Doring and P. Eichelsbacher have shown how dependency graphs 
can be used to obtain some moderate deviation principles [DE12, Section 2]. 

Here, we shall see a link between dependency graphs and mod-Gaussian convergence. 
This gives us a large collection of examples, for which the material of this article gives 
automatically some precise moderate deviation results. Our deviation result has a larger 
domain of validity than the one of Doring and Eichelsbacher — see below. 
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In this section, we establish a general result involving dependency graphs (Theorem 6.3). 
In the next two sections, we focus on examples and derive the mod-Gaussian convergence 
of the following renormalized statistics: 

• subgraph count statistics in Erdos-Renyi random graphs (Section 7); 

• random character values from central measures on partitions (Section 8). 

6.1. The theory of dependency graphs. Let us consider a variable X, which writes 
as a sum 

of random variables Y^ indexed by a set V. 

Definition 6.1. A graph G with vertex set V is called a dependency graph for the family 
of random variables {Y^, a G V} if the following property is satisfied: 

If Vi and V2 are disjoint subset of V such that there are no edges in G 
with one extremity in V\ and one in V2, then the sets of random variables 
{Ya}a£Vi (^'f^d {Ya\a<^V2 ^'^^ independent (i.e., the a-algebras generated by 
these sets are independent) . 

Note that a family of random variables may admit several dependency graphs. In 
particular, the complete graph with vertex V is always a dependency graph. However, this 
is not interesting: the sparser is the dependency graph, the better are the resulting bound. 
From a dependency graph, one can derive bounds on the cumulants of X] see [Jan88, 
Lemma 4] 

Theorem 6.2. For any integer r > 1, there exists a constant Cr with the following 
property. Let {Ya}a&v be a family of random variables with dependency graph G. We 
denote N = \V\ the number of vertices of G and D the maximal degree of G. Assume 
that the variables Ya have all finite moments and that there exists a constant A such that, 
for all a E V, 

\\Y4r = (E[(y;)n)'/'- < A. 

Then, if X = Yla^a, one has: 

K^''\X) < Gr N {D + ly-^ A' . 

This theorem is often used to prove some central limit theorem. In [DE12], Doring 
and Eichelsbacher have strengthen this theorem, showing that Gr = (2e)''(r!)'^. Then 
they have used this new bound to obtain some moderate deviation results. Here, using 
combinatorial arguments, we show the following: 



Theorem 6.3. Theorem 6.2 holds with Gr = 2*' ^ 






We shall see in next Section that this stronger version can be used to establish mod- 
Gaussian convergence and, thus, precise moderate deviation results. 

6.2. Joint cumulants. There is a multivariate version of cumulants, called joint cumu- 
lants, that we shall use to prove Theorem 6.3. We present in this paragraph its definition 
and basic properties. Most of this material can be found in Leonov's and Shiryaev's 
paper [LS59] (see also [JLROO, Proposition 6.16]). 
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6.2.1. Preliminaries: set-partitions. We denote by [n] the set {1,. . . ,n}. A set partition 
of [n] is a (non-ordered) family of non-empty disjoint subsets of S (called parts of the 
partition), whose union is [n]. For instance, 

{{1,3, 8}, {4, 6, 7}, {2, 5}} 

is a set partition of [8]. Denote 0.{n) the set of set partitions of [n]. Then Q.{n) may be 
endowed with a natural partial order: the refinement order. We say that vr is finer than 
vr' or vr' coarser than vr (and denote vr < vr') if every part of vr is included in a part of vr'. 

Lastly, denote /i the Mobius function of the poset Q.{n). In this paper, we only use 
evaluations of /i at pairs (vr,{[n]}) (the second argument is the partition of [n] in only 
one part, which is the maximum element of £l{n)), so we shall use abusively the notation 
yu(vr) for yu(vr, {[t^]}). In this case, the value of the Mobius function is given by: 

/i(vr) = /i(vr, {[n]}) = (-l)#W-^(#(vr) - 1)!. (16) 

6.2.2. Definition and properties of joint cumulants. If Xi,...,Xr are random variables 
with finite moments on the same probability space (denote E the expectation on this 
space), we define their joint cumulant by 



/.(Xi, . . . , X,) = [ti ■ ■ ■ tr] log U [^t.X.+-+t.X.] \ . 



(17) 



As usual, [ti . . . tr\F stands for the coefficient of ti ■ ■ ■ t^ in the series expansion of F in 
positive powers of ti, . . . ,tr- Note that joint cumulants are multilinear functions. In the 
case where all the Xj's are equal, we recover the r-th cumulant k^'''\X) of a single variable, 
see Section 1. Joint cumulants can be expressed in terms of joint moments, and vice-versa: 

E[Xi---X,]= Y, \[ti{X,-ieC)- (18) 

7reQ(r) CStt 



n^^ 



(19) 



«:(Xi,...,X,)= Y. ^i{^r)\[^ 

7re£!(r) Ce-n UsC 

In these equations, C G vr shall be understood as "C is a part of the set partition vr". 
Recall that /i(vr) has an explicit expression given by Equation (16). For example the joint 
cumulants of one or two variables are simply the mean of a single random variable and 
the covariance of a couple of random variables: 

fi:(Xi) = E[Xi] ; fi:(Xi, X^) = E[XiX2] - E[Xi] E[X2]. 

For three variables, one has 

k{X,,X2, X3) = EiXiXsXs] - E[XiX2] E[X3] - E[X,X,] E[X,] 
- E[X2X3] E[Xi] + 2 E[Xi] E[X2] E[X3]. 

6.2.3. Statement with joint cumulants. Let {Falaey be a family of random variables with 
dependency graph G. As in Theorem 6.2, we assume that the variables Y^ have all finite 
moments and that there exists a constant A such that, for all a & V, 

\\Yjr = mYaY]f'' < A. 

Consider r subsets Vi, V2,. . .Vr oi V, non necessarily distinct and set Xj = Xloey ^" (for 
i G [r]). We denote Di the maximal number of vertices in Vi adjacent to a given vertex 
(not necessarily in V)- Then one has the following result. 
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Theorem 6.4. With the notation above, 

\k{Xi, ...,Xr)\< 2'-' r"--^ \Vi\ {D2 + !)■■■ {D,. + 1) A^ 

The proof of this theorem is very similar to the one of Theorem 6.3. However, to simphfy 
notations, we only prove the latter here. 



6.3. Useful combinatorial lemmas. We start our proof of Theorem 6.3 by stating a 
few lemmas on graphs and spanning trees. 

6.3.1. A functional on graphs. In this Section, we consider graphs H with multiple edges 
and loops. We use the standard notations V{H) and E{H) for their vertex and edge sets. 
For a graph H and a set partition vr of V{H), we denote tt ± // when the following is 
true: for any edge {i,j} G E{H), the elements i and j lie in different parts of vr (notation: 
i oo^ j). We introduce the following functional on graphs H: 



F^ = (-1)1^^)1-1 ^^( 



TT±H 



Lemma 6.5. For any graph H , one has 

y^ (_i)ii?i-iv(H)i+i_ 



■ H 



ECE{H) 
(V(H),E) connected 

Proof. To simplify notations, suppose V{H) = [r]. We denote l(p) the characteristic 
function of the property (P). By inclusion-exclusion, 

(_i)im)i-iF^= y: ( n i^^Jm"^= ^ ( n (i-i-'-..))m-) 

7rei3(r) \{i,j)&E{H) J 7rei3(r) \(i,j)eE(//) / 

= E E (-1)"" ( n i.-w) f'i-) 

ECE{H) TTeQ{r) \{i,j)&E J 



E (-1)'"' 

EcEiH) 



Yl /^(^) 



vr such that 
.V(i,j)6S, i~^j 



But the quantity in the bracket is unless the only partition in the sum is the maximal 
partition {[t"]}, in which case it is 1. This corresponds to the case where the edges in E 
form a connected subgraph of H. D 



Corollary 6.6. The functional F h fulfills the deletion- contraction induction, i.e., if e is 
an edge of H, then 

where H\e (respectively H/e) are the graphs obtained from H by deleting (resp. contract- 
ing) the edge e. 

Proof. The first term corresponds to sets of edges containing e, and the second to those 
that do not contain e. D 
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This induction determines Fh together with the initial conditions: 

F. = l, 

Fh = if if is disconnected. 

Corollary 6.7. For any graph H, the quantity Fh is nonnegative and less or equal than 
the number ST// of spanning trees in H . 

Proof. The quantity ST// fulfills the same induction as F// with initial conditions: 

ST. = 1, 

ST„ =ST =••• = !, 

ST// = if if is disconnected. □ 

Remark 6.8. Both F// and ST// are actually specializations of the bivariate Tutte polyno- 
mial TH{x,y) of H {cf. [Bol98, Chapter X]): 

F// = T//(1,0) ; ST//=T//(1,1). 

This explains the deletion-contraction relation. As the bivariate Tutte polynomials has 
non-negative coefficients, it also explains the inequality < F// < ST//. 

6.3.2. Induced graphs containing spanning trees. Fix a graph G (typically the dependency 
graphs of our family of variables). For a list (f i, . . . , f^) of r vertices of G, we define the 
induced graph G[vi, . . . ,Vr\ as follows: 

• its vertex set is [r]; 

• there is an edge between i and j if and only if Vi = Vj or Vi and Vj are linked in G. 

We will be interested in spanning trees of induced graphs. As the vertex set is [r] , these 
spanning trees may be seen as Cayley trees. Recall that a Cayley tree of size r is by 
definition a tree with vertex set [r] (Cayley trees are neither rooted, nor embedded in the 
plane, they are only specified by an adjacency matrix). These objects are enumerated by 
the well-known Cayley formula established by C. Borchardt in [Bor60]: there are exactly 
r^~^ Cayley trees of size r. 

Lemma 6.9. Fix a Cayley tree T of size r and a graph G with N vertices and maximal 
degree D. The number of lists {vi, . . . ,Vr) of r vertices of G such that T is contained in 
the induced subgraph G[vi, . . . , Vr] is bounded from above by 

Proof. Lists {vi, . . . ,Vr) as in the lemma are constructed as follows. First choose any 
vertex Vi among the A^ vertices. Then consider a neighbor j of 1 in T. As we require 
G[vi, . . . ,Vr] to contain T, they must also be neighbor in G[vi, . . . ,Vr], that is to say that 
Vj = f 1 or Vj is a neighbour of Vi in T. Thus, once Vi is fixed, there are at most D + 1 
possible values for Vj. The same is true for all neighbors of 1 and then for all neighbors 
of neighbors of 1 and so on. D 

We have the following immediate consequence. 
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Corollary 6.10. Let G be a graph with n vertices and maximal degree D and r > 1. The 
number of couples ((f i, . . . , f^), T) where each Vi is a vertex of V and T a spanning tree 
of the induced subgraph G[vi, . . . ,Vr] is bounded above by 

r"^-^ N {D + lY-\ 



6.3.3. Spanning trees and set partitions of vertices. Recall that STh denotes the number 
of spanning trees of a graph H. Consider now a graph H with vertex set [r] and a set 
partition tt = (tti, . . . ,TTt) of [r]. For each i, we denote ST^^{H) = ST//[^^] the number 
of spanning trees of the graph induced by H on the vertex set ttj. We also use the 

multiplicative notation 

t 

i=i 
We can also look at the contraction H/n of H with respect to tt. By definition, it is the 
multigraph [i.e. graph with multiple edges, but no loops) defined as follows. Its vertex set 
is the index set [t] of the parts of tt and, for i ^ j, there are as many edges between i and 
j as edges between a vertex of ttj and a vertex of ttj in H. Denote STt^{H) = STfj/n the 
number of spanning trees of this contracted graph (multiple edges are here important). 
This should not be confused with ST^{H): in the latter, vr is placed as an exponent 
because the quantity is multiplicative with respect to the part of tt. 

Note that the union of a spanning tree T of H/tt and of spanning trees Tj of H[TTi] (for 
1 < i < t) gives a spanning tree T of H. Conversely, take a spanning tree T on H and a 
bicoloration of its edges. Edges of color 1 can be seen as a subgraph of H with the same 
vertex set [r]. This graph is of course acyclic. Its connected components define a partition 
TT = {vTi, . . . , TTf} of [r] and edges of color 1 correspond to a collection of spanning trees Tj 
of H[7Ti\ (for 1 < i < t). Besides, edges of color 2 define a spanning tree T on H/tc. 

Therefore, we have described a bijection between spanning trees of H with a bicoloration 
of their edges and triple (vr,T, (Tj)i<j<i) where: 

• TT is a set partition of the vertex set [r] of H (we denote t its number of parts); 

• T is a spanning tree of the contracted graph H/ir; 

• for each 1 < z < t, Tj is a spanning tree of the induced graph H[7ii]. 

Before giving a detailed example, let us state the numerical corollary of this bijection: 

r~^ STh = J2 ST.(^) ST^{H), (20) 

TV 

where the sum runs over all set partitions tt of [r]. 

Our bijection is illustrated on Figure 9, with the following conventions: 

• red dotted edges belong to the graph but not to the spanning tree; 

• blue plain edges are edges of color 1 in the tree; 

• green dashed edges are edges of color 2 in the tree. 
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T 




D ^ <^ 



TT = {tti, 7r2} with TTi = {1, 2, 3}, 7r2 = {4, 5, 6}; 



Ti 



T 



<2) (4: 




I 




5) T6) 



Figure 9. Bijection explaining Identity (20). 

Note that in this example the graph H/tt has two vertices linked by four edges. These 
four edges correspond to the edges (1,4), (1,6), (2,6) and (3,5) of H. In the example, 
the spanning tree T is the edge {(3, 5)}. If we had chosen another edge, the tree T on the 
left-hand side would have been different. Hence, in the Equality (20), the multiple edges 
of iJ/vr must be taken into account: in our example, ^T.^{H) = 4. 



6.4. Proof of the bound on cumulants. Recall that we want to find a bound for 
K^'^\X). As X writes X = Xlaey ^cn ^^ ^-^Y ^^^ joint cumulants and expand by multi- 
linearity: 

«:W(X)= Yl <Y^.,---,yc..)- (21) 

ai,...,ar 

The sum runs over lists of r elements in V, that is vertices of the dependency graph G. 
The proof consists in bounding each summand K,{Ya-i^, . . . ,Ya^), with a bound depending 
on the induced subgraph G[ai, . . . , ar]- 

6.4.1. Bringing terms together in joint cumulants. Recall the moment-cumulant formula 



(19) which states K,(Ya^, 



, J^ar, 



X;^/i(7r)M^, where 



M^ 



Bevr 



.ieB 



We warn the reader that the notation M^^ is a little bit abusive as this quantity depends 
also on the list («!,...,«,.). By hypothesis, G is a dependency graph for the family 
{Ya}a&v- Hence if some block i? of a partition vr can be split into two sub-blocks Bi and 
B2 such that the set of vertices {ai}i^Bi and {ai}i^B2 are disjoint and do not share an 
edge, then 



E 



Therefore, M^ 



M, 



<I>hM 



jeB 
where H 



E 



JeBi 
G[ai, . 



X E 



(22) 



ieB.2 

ar] and 0//(vr) is the refinement of n 



obtained as follows: for each part ttj of tt, consider the induced graph if [ttj] and replace ttj 
by the collection of vertex sets of the connected components of Hlni]. This construction 
is illustrated on Figure 10. 
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H 




For example, consider the graph H here opposite and 
the partition vr = {7ri,7r2} with tti = {1,2,3,4} and 
7r2 = {5,6}. Then -f/^[7ri] (respectively, H[tt2\) has 
two connected components with vertex sets {1, 2} and 
{3,4} (resp. {5} and {6}). Thus 

0H(vr) = {{l,2},{3,4},{5},{6}}. 



Figure 10. Illustration of the definition of ipn- 



We can thus write 



k(G) = Y^ M^, 



^7re0„^(7r') 



^(tt) 



Fix it' = {tt[, . . . , ttQ and let us have a closer look to the expression in the parentheses 
that we will call ajr'- To compute it, it is convenient to consider the contraction H/tt' of 
the graph H with respect to the partition vr'. 

Lemma 6.11. Let tt' be a set partition of [r]. If one of the induced graph H[tt'j] is 
disconnected, then a,r' = 0. Otherwise, a^^i = Fh/tt'- 



Proof. The first part is immediate, as (p^^ 



-'(tt') 



in this case. 



If all induced graphs are connected, let us try to describe 0^^(7r'). All set partitions vr 
of this set are coarser than vr', so can be seen as set partitions of the index set [r] of the 
parts of tt'. This identification does not change their Mobius functions, which depends 
only on the number of parts. Then, it is easy to see that n lies in 0^^(7r') if and only if 
TT is coarser than vr' and two elements in the same part of tt never share an edge in H/tt' 
(here, tt is seen as a set partition of [r]). In other words, vr lies in (J)JI{tt') if and only if 
TT _L {H/tt'). This implies the Lemma. D 



Consequently, 



l^iXai, ■ ■ ■ , Yar) — 2_^ ^^' '^H/tt' I J_ J_ '^H[tt'^ connected 



(23) 



,i=l 



where the sum runs over all set partitions vr' of [r] 



6.4.2. Bounding all the relevant quantities. The following bound for M^r follows directly 
from Holder inequality and the assumption ||l^a||r < ^ (for every a & V): 

\M^\ < A''. (24) 

Finally, to bound each summand ^{Ya^, . . . , Ya^), we shall use the following bounds: 

I Fh/tt' I < STh/vt' by Corollary 6.7; 

lH[7r^] connected < ^'Fh[tt'.] ■ 

Thus, Equation (23) gives 



\^{Y^„...,Y^^)\<A'- Y, STha' n^Ti/K] =A'-2'-'STh, 



(25) 



,i=l 
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the last equality corresponding to Equation (20). Using Equation (21), we get that the 
cumulant k^'^\X) is smaller than T'^^ A!" times the number of couples ((q;i, . . . , a^),^) 
where (ai, . . . , a^) is a list of vertices of the dependency graph G and T a spanning tree 
of the induced subgraph (^[ai, . . . , a^\. 

Corollary 6.10 now ends the proof of Theorem 6.3. 



7. Subgraph counts in Erdos-Renyi random graphs 

In this section, we consider Erdos-Renyi model r(n,p„) of random graphs. A random 
graph r with this distribution is described as follows. Its vertex set is [n] and for each pair 
{«, j} C \n\ with i ^ j, there is an edge between i and j with probability p„. Moreover, all 
these events are independent. We are then interested in the following random variables, 
called subgraph count statistics. If 7 is a fixed graph of size k, then X!f' is the number of 
copies of 7 contained in the graph r(n,p„) (a more formal definition is given in the next 
paragraph). This is a classical parameter in random graph theory; see, e.g. the book of 
S. Janson, T. Luczak and A. Rucinski [JLROO]. 

The first result on this parameter was obtained by P. Erdos and A. Renyi, cf. [ER60]. 
They proved that, if 7 belongs to some particular family of graphs (called balanced), one 
has a threshold: 

JimP[X, >0]-|^ ifn-VM7) = ,(p„), 

where m{'j) = \E{'~f)\/\V{'y)\. This result was then generalized to all graphs by B. Bollobas 
in [BolOl], but the parameter 771(7) is in general more complicated than the quotient 
above. Consider the case n"^/™^'''-' = o(p„), when the graph T{n,Pn) contains with high 
probability a copy of 7. It was then proved by A. Rucinski (see [Ruc88]) that, under the 

additional assumption n^(l— p„) — )■ cxd, the fluctuations oi X-f' are Gaussian. This result 
can be obtained using dependency graphs; see e.g. [JLROO, pages 147-152]. 

Here, we consider the case where p„ = p is a constant sequence (0 < p < 1). The 
possibility of relaxing this hypothesis is discussed in Section 7.3.3. Denote a„ = n"^ and 
Pn = n'^"^, where k is the number of vertices of 7. It is easy to check that 

E[X(")] = ca„/3„ ; Var(X(")) = a^ a„ (/3„)2 

for some positive constants c and a — see, e.g., [.JLROO, Lemma 3.5]. Hence, Rucihski's 
central limit theorem asserts that, if T ~ x^/ct^ for some fixed real x, then 



lim P 

n—^oo 



xr^E[xn>y 



x/a 

e-^"'^du. 
27r J -00 



Using Theorem 6.3, we shall extend this result to a framework where x tends to infinity, 
but not to quickly: (a„)^^^ < T < (ttn)^/"^. 

Theorem 7.1. Let < J9 < 1 and 'j be a graph with k vertices. We consider X^ the 
number of copies of '~f contained in Erdos-Renyi random graph r{n,p). Let an and /3„ be 
defined as above. 
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(1) The renormalized variable [X^i —W[X^]) / [a-ri (3n) converges vaod- Gaussian with 
parameters tn = o"^ (Xn and limiting junction ^'{z) = exp (f-z^), where a and L 
are computed in §7.3.2. 

(2) Besides, if (a^)-^/^ ^ T ^ (a;„)'^'''^, the following precise moderate deviations 
principle holds: 



P 



Jn 



>T 



e 2-"-" / LT^ 



27ra2 



T2 



'='p(6^)(^ + °«)- '2^' 



A similar result has been obtained by H. Doring and P. Eichelsbacher in [DE12, Theorem 
2.3]. However, 

• their result is less precise as they only obtain the equivalence of the logarithms of 
the relevant quantities (in particular, when we look at the logarithm, the second 
factor of the right-hand side is negligible); 

• and their proof works for T <^ n^/^ while ours is valid for T <^ n?/"^. 

Unfortunately, we cannot get deviation results when T ~ xn^ for some real number x; this 
would correspond to evaluate P[X^" > (1 + e) E[X|" ]]. For large deviations equivalents 
of 

logP[X(")>(l + £)E[X(")]] 

in) 

with X-^ the subgraph count in a Erdos-Renyi random graph, there is a quite large 
literature, see [CVll, Theorem 4.1] and [Clial2] for recent results in this field. As we 
consider deviations of a different scale, our result is neither implied by, nor implies these 
results. Note, however, that their large deviation results are equivalents of the logarithm 
of the probability, while our statement is an equivalent for the probability itself. 



7.1. A bound on cumulants. 

7.1.1. Subgraph count statistics. In the following we denote 2t(n, A;) the set of arrange- 
ments in [n] of length fc, i.e.., lists of k distinct elements in [n\. The cardinality of 2t(?7., k) 
is the falling factorial n'^^ = n{n — l) ■ ■ ■ {n — k+l). Let A = (ai, . . . , a^) be an arrangement 
in [n] of length k, and 7 be a fixed graph with vertex set [k]. Recall that F = F(ra,p„) is 
a random Erdos-Renyi graph on [n]. We denote S^{A) the following random variable: 

6,iA) = l^ if7CF[a„...,a,]; ^^7) 

I else. 

Here F[ai, . . . , a^] denotes the graph induced by F on vertex set {oi, . . . , a^}. As our data 
is an ordered list (oi, . . . , a^), this graph can canonically be seen as a graph on vertex set 
[k], that is the same vertex set as 7. Then the inclusion should be understood as inclusion 
of edge sets. 

For any graph 7 and any integer n > 1, we then define the random variable X^" by 

Ae2l(n,|7l) 
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Remark 7.2. It would also be natural to replace in Definition (27) the inclusion by an 
equality 7 = r[ai, . . . , a^]. This would lead to other random variables Y.y"' , called induced 
subgraph counts. Their asymptotic behavior is harder to study (in particular, fluctuations 
are not always Gaussian; see [.JLROO, Theorem 6.52]). Notice however that if 7 is a 
complete graph, then the two definitions correspond. Hence, our results include the 
numbers of fc-cliques in a random Erdos-Renyi graph. 

7.1.2. A dependency graph of the subgraph count statistics. Fix some graph 7 with vertex 
set [k]. By definition, the variable we are interested in writes as a sum 

Ae2l(n,fc) 

We shall describe a dependency graph for the variables {S.y{A)}Ae<n{n,k)- 

For each pair e = {v, v'} C [n], denote Jg the indicator function of the event: e is in the 
graph r{n,p). By definition of the model r{n,p), the random variables Je are independent 
Bernouilli variables of parameter p. Then, for an arrangement A, denote E{A) the set of 
pairs {v,v'} where v and v' appear in the arrangement A. One has 

6,{A)= H h, 

where E'{A) is a subset of E{A) determined by the graph 7. In particular, if, for two 
arrangements A and A', one has |-E(A) H E{A')\ = (equivalently, \A fl A'\ < 1), then 
the variables S^{A) and S^{A') are defined using different variables I^. (and, hence, are 
independent). This implies that the following graph denoted B is a dependency graph for 
the family of variables {S^{A)}Ae<n(n,k)'- 

• its vertex set is 2t(ra, k); 

• there is an edge between A and A' if 1^4 n A'l > 2. 

Considering this dependency graph is quite classical - see, e.g., [JLROO, Example 1.6]. 

All variables in this graph are Bernouilli variables and, hence, bounded by 1. Besides 
the graph B has A^ = n^'^ vertices, and is regular of degree D smaller than 

,\ 2 
J 2 (n - 2){n - 3) ... (n - A; - 1) < k^n''-\ 

Indeed, a neighbour A' of a fixed arrangement A G 2t(n, k) is given as follows: 

• choose a pair {aj, Oj} in A that will appear in A'; 

• choose indices i' and j' such that a'-, = ai and a'-, = aj (these indices are different 
but their order matters); 

• choose the other values in the arrangement A'. 
So, we may apply Theorem 6.3 and we get: 

Proposition 7.3. Fix a graph 7 of vertex set [k]. For any r <1, one has 

UW(x("))| < 2'-^r'-^ n'' {k* n^~^Y-\ 
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7.2. Polynomiality of cumulants. 

7.2.1. Dealing with several arrangements. Consider a list (A^, . . . , A'') of arrangements. 
We associate to this data two graphs (unless said explicitly, we always consider loopless 
simple graphs, and V{G) and E{G) denote respectively the edge and vertex sets of a 
graph G): 

• the graph Ga has vertex set 

Vi = V{Ga) = {(t,0 I 1 < * < r, 1 < t < A;,} 

and an edge between [t,i) and {s,j) if and only if a] = a{. It is always a disjoint 
union of cliques. If the A^'s are arrangements, then the graph Ga is endowed with 
a natural proper r-coloring, (t, i) being of color i. 

• the graph if™ has vertex set [r] and an edge between i and j if |y4* fl A^\ > m. 

Notice that if]^ is the contraction of the graph Ga by the map (f : {t,i) ^-^ i from the 
vertex set of Ga to the vertex of Hj^. Indeed, 

(i, j) e E{Hl) ^ 3vi G ip'\i),Vj e ip~\3) such that {v,,Vj) e E{Ga). 

An example of a graph Ga and of its (1-) contraction H^ is drawn on Figure 11. For 
m > 2, the definition of H^ is less common. We call it the m-contraction of Ga. The 
2-contraction is interesting for us. It corresponds exactly to the graph induced by the 
dependency graph B on the list of arrangement A, considered in the proof of Theorem 6.3. 
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Figure 11. The graphs Ga, H\ and H\ corresponding to the family of 
arrangements (2,5,6,4,1), (2,7,9,4), (4,16,3), (8,10,5) (15,13,14,11,12). 



Remark 7.4. Graphs associated to families of arrangements are a practical way to encode 
some information and should not be confused with the random graphs or their induced 
subgraphs. Therefore we used greek letters for the latter and latin letter for graphs Ga 
and their contractions. The dependency graph will always be called B to avoid confusions. 
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7.2.2. Exploiting symmetries. The dependency graph of our model has much more struc- 
ture than a general dependency graph. In particular, all variables S.y{A) are identically 
distributed. More generally, the joint distribution of 

depends only of the 2— contraction of G^. Here, we state a few consequences of this 
invariance property that will be useful in the next Section. 

Lemma 7.5. Fix a graph 7 of size k, the quantity 

E[6^{A') ■ ■ ■ 6^{A')] 

depends only on the graph Ga associated to the family of arrangements {A^, . . . , A^). The 
same is true for the joint cumulant k, {6.y{A^), . . . , 6^{A^)) . 

Proof. The first statement follows immediately from the invariance of the model T{n, p) by 
relabelling of the vertices. The second is a corollary, using the moment-cumulant relation 
(19). D 

Corollary 7.6. Fix some graph 7. Then the joint cumulant K{X!f' , . . . ,X!f') is a poly- 
nomial in n. 

Proof. Using Lemma 7.5, we can rewrite the expansion (21) as 

«:(X("),...,X(«)) = 5^«:(G)iVG, (28) 

G 

where: 

• the sum runs over graphs G of vertex set Vjk that correspond to some arrangements 
(that is G is a disjoint union of cliques and, for any s, t and i, there is no edge 
between {s,i) and {t,i)); 

• K,{G) is the common value of k {6^{A^), . . . , (5^(y4'')), where {A^, . . . , A"^) is any list 
of arrangements with associated graph G; 

• Ng is the number of lists of arrangements with associated graph G. 

But it is clear that the sum index is finite and that neither the summation index nor 
the quantity n{G) depend on n. Besides, the number Nq is simply the falling factorial 
n{n — 1) . . . (n — C(7 + 1), where Cg is the number of connected components of G. The 
corollary follows from these observations. D 



7.3. Moderate deviations for subgraph counts. 

7.3.1. End of the proof of Theorem 7.1. We would like to apply Corollary 3.6 to the 
sequence Sn = Xj — E[X|" ] with «„ = n^ and /3„ = n'^~^. Let us check that Sn indeed 
fulfills the hypothesis. 

(1) The uniform bound \K^^\Sn)\ < (Cr)''a„(/9„)'', where C does not depend on n, 
corresponds to Proposition 7.3; we may even choose G = 2k'^. 
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(2) We also have to check the speed of convergence: 

«:(^)(5„) = a2aJ/3„)2(l + o(«;^/i^)) ; K^'\Sr.) = La^{P^f {I + o{a-''')). (29) 

But these estimates follow directly from the bound above for r = 2, 3 and the fact 
that K^^'\Sn) is always a polynomial in n — see Corollary 7.6. 

Finally, the mod-Gaussian convergence follows from the observations in Section 3.2 and 
we can apply Corollary 3.6 to get the moderate deviation statement. This ends the proof 
of Theorem 7.1. D 

Remark 7.7. Using Theorem 6.4, we could obtain a bound for joint cumulant of subgraph 
counts. Hence, it would be possible to derive mod-Gaussian convergence and moderate 
deviation results for linear combinations and vectors of subgraph counts, in the spirit of 
Section 4. However, as we do not have a specific motivation for that and as the statement 
for a single subgraph count statistics is already quite technical, we have chosen not to 
present such a result. 



7.3.2. Computing a"^ and L. The proof above does not give an explicit value for cr^ and 
L. Yet, these values can be obtained by analyzing the graphs G that contribute to the 
highest degree term of k^^-* and k^'^'. 

Lemma 7.8. Let 'y be a graph with k vertices and h edges. Then the positive number a 
appearing in Theorem 7.1 is given by 

a^ = 2hy-\l-p). 

Proof. By definition, cr^ is the coefficient of n^'^"^ in k^'^^X!^^ ). As seen in Equation (28), 
the quantity n^'^^X!^^ ) can be written as 

G 

where the sum runs over some graphs G with vertex set l^ U \^. However, we have seen 
that k{G) = unless the 2-contraction H^ of G is connected — see Inequality (25) — and 
on the other hand, Nq is a polynomial in n, whose degree is the number Cg of connected 
component of G. 

As we are interested in the coefficient of n'^^~'^, we should consider only graphs G with 
at least 2k — 2 connected components and a connected 2-contraction. These graphs are 
represented on Figure 12. 




Figure 12. Graphs involved in the computation of the main term in k^'^\X. 



7 / 
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Namely, we have to choose a pair of vertices on each side and connect each of these vertices 
to one vertex of the other pair (there are 2 ways to make this connection, if both pairs 
are fixed). A quick computation shows that, for such a graph G, 



<G) 



V 




2/1-1/ 



1 — p) if both pairs correspond to an edge of 7; 
else. 



Finally there are 2h graphs with a non-zero contribution to the coefficient of n in 
k(2)(X^")). For each of these graph, Ng = n{n - I) . . . {n - 2k + ?,) = n^'^^'^il + o(l)). 
Therefore, the coefficient of n^'^"^ in k^'^\X^i) is 2/i^j9^'^^^(l — p), as claimed. D 



The number L can be computed by the same method. 

Lemma 7.9. Let 'y be a graph with k vertices and h edges. Then the number L appearing 
in Theorem 7.1 is given by 

L = 12h%h - l)p^'''\l - pf + Ah^p^^-^{1 -p){l- 2p). 

Proof. Here, we have to consider graphs G on vertex set ^ U ^ U \^ with at least 3fc — 4 
connected components and with a connected 2-contraction. These graphs are of two kinds, 
see Figure 13. 





Figure 13. Graphs involved in the computation of the main term in k^'^\X^ 



7 . 



In the first case (left-hand side picture), an edge on the left can possibly have an 
extremity in common with an edge on the right. In this case, one has to add an edge 
to complete the triangle (indeed, all graphs G are disjoint unions of cliques). But this 
cannot happen for both edges on the left simultaneously, otherwise the graph belong to 
the second family. 

The following is now easy to check. There are 12h^{h — 1) graphs of the first kind with 
a non-zero cumulant k{G) — 3 choices for which set V plays the central role, h^{h — 1) 
for pairs of vertices and 4 ways to link the chosen vertices — and, for these graphs. 



the corresponding cumulant is always k{G) 



p 



3/1-2, 



1 — pY ■ Similarly, there are Ah^ 



graphs of the second kind with a non-zero cumulant k{G). For these graphs, k{G) 



p 



,3/1-2/ 



1 — p)(l — 2p). In both cases, Nq 



n 



3A:-4/ 



1 + o{l)). This completes the proof. D 



Example 7.10. Denote T„ the number of triangles in a random Erdos-Renyi graph r(n,p). 
According to the previous Lemmas, the parameters cr^ and L are respectively 



a^ = 18p^{l-p) and L = 108/ (1 -p)(7 - 8p). 
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Moreover, E[T„] = -n}^ p^ = n^ p^ — Sn^ p^ + 0{n). So, 

^rr^ ^ ^ 9/ -^M 1 ( v'^ (7-8p)v^ 

^ ^ ^^ 7367rp5(i_p)^2 ^\^ 36p5(i_p) 3n72p(l-p)^ 

for 1 < t; < n^/2. 

7.3.3. Case of non-constant sequences pn- Proposition 7.3 still holds when p„ is a non- 
constant sequence (the particularly interesting case is p„ — )■ 0). One can even sharpen a 
little this bound by replacing Inequality (24) by the trivial bound 

\M^\ < (pn)^ 

where h is the number of edges of 7. Doing that, we get 



\K 



(r) 



(X("))| < CV^-2n'■('=-2)+2(p„)^ 



7 

But, unlike in the case Pn = P constant, this bound is not always optimal (up to a 
multiplicative constant) for a fixed r. For example, if Pn = n'"^ with e > sufficiently 
small, then — see [JLROO, Lemma 3.5] — 

Var(X^")) ~ const, x n^^-^^p^f^-^ < r^2^-2(p„)^ 

Finding a uniform bound for cumulants, whose dependence in r is of order {CrY (so that 
we have mod-Gaussian convergence), and which is optimal for fixed r is an open problem. 

Yet, we can still give some deviation result. Let a;„ and /?„ be defined as follows: 

«n = n2(Pn)''-'(l-Pn)'; 



With these choices, one has 



UM(x("))|<(cV)^a„(/3„)' 



Unfortunately the convergence speed hypotheses (29) are not always satisfied, so one 
cannot apply Corollary 3.6 in general. However, one can see that k^'^\X^ ) and k'^^\X^ ) 
are polynomials in n and p„ of degree 2h and 3/i in pn- Thanks to this observation, if p„ 
is of order n~^ with < £ < l/6/i, then Conditions (29) are satisfied and Theorem 7.1 
still holds in that case. 



8. Random character values from central measures on partitions 

The combinatorial techniques introduced in Sections 6 and 7 can also be used to study 
certain models of random integer partitions. Recall that if G is a finite group and if r is 
a function G — )■ C with t^Cg) = 1 and T{gh) = T{hg) (a trace on the group), then r can 
be expanded uniquely as a linear combination of normalized irreducible characters: 



J2^r[X]x' 



xeG 



This yields a probability measure P,- on the finite set G of isomorphism classes of irre- 
ducible representations of G. This spectral measure is a priori complex-valued. When 
G = &{n) is the symmetric group of order n, the irreducible representations are la- 
belled by integer partitions of size n, that is non-increasing sequences of positive integers 
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A = (Ai, . . . , A^) with Yli=i ^i ~ ^- ^"^ th^ following we denote '^{n) = &{n) the set of 
integer partitions of size n, and ^(A) = i the length of a partition. 

Definition 8.1. A central measure on partitions is a family (Pr.rJneN of spectral measures 
on the sets ^(n) that come from the same trace of the infinite symmetric group (5(oo). 
Thus, there exists a trace r : (5(oo) — > C such that 

r|6(n)= Y. P-."W^^- 

Example 8.2. The regular trace r(cr) = l(j=id corresponds to the Plancherel measures of 
the symmetric groups, given by the formula Pn[A] = (dim V"'*')^/n!, where V^ is the ©(n)- 
irreducible module of label A. They have been extensively studied in connection with 
Ulani's problem of the longest increasing subsequence and with random matrix theory, 
see e.g. [BDJ99, BOOOO, OkoOO, IO02]. 

A central measure (PT,n)neN is non-negative if and only if {T(pipJ ))i<i,j<n is hermitian 
non-negative definite for any finite family of permutations pi, . . . ,Pn- The set of non- 
negative central measures, i.e., coherent systems of probability measures on partitions 
has been identified in [Tho64] and later studied in [KV81]. Call extremal a non-negative 
trace on ©oo that is not a positive linear combination of non-negative traces. Then, 
extremal central measures are labelled by the infinite-dimensional Thoma simplex 



n = ^00 = {a, 13) = ((ai > as > ■ ■ ■ > 0), (A > /^s > ■ ■ ■ > 0)) 
The trace on the infinite symmetric group corresponding to a parameter u is given by 



OO s 

5^a^ + A<U. 



cc 



rM= Yl P\c\{uj) withpi(w) = l, pfc>2(w) = ^(a,)'= + (-l)'=-^(A)^ 

ceC{a) i=l 

C{a) denoting the set of cycles of a. In particular, if p and z/ are two permutations with 
disjoint non-trivial cycles, then for any extremal trace on ©(oo). 

This property is known as asymptotic factorization, and it is involved in most of the 
asymptotic results on central measures. Kerov and Vershik have shown that if w G fi and 
p G ©(oo) are fixed, then the random character value X^{p) with A chosen according to 
the central measure P^^^n converges in probability towards the trace T^{p). Thus, central 
measures on partitions and extremal traces of ©(oo) are concentrated. More recently, it 
was shown by Feray and Meliot that this concentration is Gaussian, see [FM12, Mell2]. 
The aim of this Section is to use the techniques of §6-7 in order to prove the following: 

Theorem 8.3. Fix a parameter a; G fi, and a permutation p G ©(oo). Denote Xp the 
random character value X''^{p), where: 

• A G '^{n) is picked randomly according to the central measure F^^^n, 

• and x'^ denotes the normalized irreducible character of label A of &{n), that is 

tr o'^( o) 
^ ^P^ ~ ~A- — \7\ with (y , p ) irreducible representation of G{n). 
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The rescaled random variable r?!"^ (Xp — T^{p)) converges in the mod-Gaussian sense 
with parameters tn = n^^^ a"^ and limiting function tjj{z) = exp(L ^), where o"^ and L can 
be computed explicitly in terms of u and p. In particular, if p is a k-cycle, then 

cr^ = P {p2k-i{uj) -Pki^jf); 

L = e {{3k - 2) P3fe-2(w) - (6A; - 3) P2k-i{oo) Pk{oo) + (3A; - 1) pkiu)^) . 

Corollary 8.4. The random character value XJ^ = Xp with p a k-cycle satisfies the 
principle of moderate deviations 

.,2 

^3 



4"'-P.HI>-rV^.] ^-|;^cosh(^) (1^ „(!)), 



P 

for 1 ^ ,T ^ n^^^ , with a^ and L as in Theorem 8.3. 



Actually we shall see during the proof that there is mod-Gaussian convergence for any 
finite vector of rescaled random character values, so the results of Section 4 holds. This 
Section is organized as follows. In §8.1, we introduce renormalized conjugacy classes and 
their cumulants, and we show that these cumulants are polynomials in n. In §8.2, we then 
prove bounds on these cumulants similar to those of Proposition 7.3, and we compute the 
limits of the second and third cumulants. This will allow us to use in §8.3 the framework of 
§3.2 in order to prove the results stated above. We shall also detail some consequences of 
these results for the shapes of the random partitions A ~ P„^(^ viewed as Young diagrams, 
in the spirit of [FM12, Mcll2]. 

8.1. Renormalized conjugacy classes. In the whole Section, a; G f2 is fixed. Given a 
partition p = (/ii, . . . , pi) of size \p\ = ^j^^ /^« ~ ^' ^^ denote 

^H,n = 2_^ ('^1,1) • • • ;CH,Aii)('3'2,l5 • • • , '^2,^2) ' ' ' ('^^,1! • • • ! '^^./^J) 

where the formal sum is taken over arrangements in 2t(ri, k) and is considered as an 
element of the group algebra C(5(n). For A G 2l(n, A;), it will be convenient to denote 
Pp{A) the corresponding term in Sp^n- 

\{n> k., then Z'^^„ is a multiple of the conjugacy class that consists in elements of cycle 
type (yUi, . . . ,pi, 1""'^), and it contains n^^ elements; otherwise it is equal to 0. We shall 
need the following facts about these renormalized character values, for which we refer to 
[IK99, Sni06b, FM12]: 

(1) In the center of the group algebra C&{n), a polynomial in elements Z'^„ is a 
linear combination with integer coefficients of Su,nS, and the coefficients of the 
expansion are independent of n. This can be seen by constructing a projective 
limit of the algebra of symbols Up^n, see [IK99]. An explicit rule to compute 
products of symbols Ep^n is given in [FM12, §3.4]. 

(2) For any (random) partition A G ^(^), we can consider the random variable 

Sp{X) = x\Sp,n)=n^^''^X^-K 

where pp is any permutation of cycle type p (assuming k < n). For any polynomial 
P(i7^(i) „, . . . , Z'^(r) „), it is the same to consider the random variable 
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or to first expand the polynomial P(Z'^(i) „, . . . , Z'^(r) ,^) = J^u ^f ^i',n in C&{n), 
and then evaluate 

V 

(3) Notice that the expectation under P„^^ of a random character value y^i^p) is r^i^p). 
For permutations pi, P25 • • • , Pr in S(n), we define: 

k(pi, . . . , p^) = ^ p(7r) n ^-^ I n ^^ 

where the sum runs over set-partitions vr of [r]. Note that the last product is 
not commutative. By convention, we always multiply the permutations pi by 
increasing order of their index. This choice is recorded by the arrow above the 
product sign. The previous remarks show that if p^, . . . , p^ are fixed partitions of 
respective sizes ki, . . . ,kr, then 



n^'^---n^'^ K(^Xlrl...,X^^l)=K{E,^,...,E, 



7rei3(r) Be-rr \ieB 

Aie2l(n,fci)7reQ(r) Beir \ieB 

A''e2l(n,fcr) 

= Y. ^{pAa'),---,pAa1)- (30) 

A''e2l(n,fc,.) 

On the second line of this list of equalities, one has a polynomial (with constant coeffi- 
cients) in traces Tij{IJi,^n), because each product Hies ^fi\n can be expanded as a sum of 
symbols S^^n. However, for every z/, T^{I^u,n) = n'^^'^^ p^{uj) is a polynomial in n, so: 

Lemma 8.5. Fix some integer partitions p}, . . . , p^. Then the rescaled joint cumulant 

n{E,., . . . , s,r) = n^^^ . . . n^'^ /.(x^;;), . . . ,x(;:)) 

is a polynomial in n. 

This is the analogue of Corollary 7.6. Using now the last line of the list of equalities (30), 
we are going to bound these joint cumulants. 

8.2. Bounds and limits of the cumulants. 

8.2.1. The dependency graph of the random character values. In a noncommutative alge- 
bra s^ endowed with a trace r, recall that subalgebras £^\ are said independent if they 
commute and if 

r(aia2 . . . a^) = r(ai)r(a2) ■ ■ ■ r^ak) 

when ttj G £^\^ with Aj 7^ \j for i ^ j. That said, consider a non-commutative algebra 
£/ endowed with a trace E : ^ — > M that satisfies Equation (22) for a block of variables 
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B that can be split in two blocks Bi and B2 of independent non-commutative random 
variables. The reader can check that the reasoning of Section 6 can be performed with 
any family of random variables in ^. In other words, one can work with abstract random 
variables in a non-commutative probability space, that are defined without an underlying 
notion of randomness; and the estimates on cumulants will still hold. We shall therefore 
work with the non-commutative probability space (C(5(n), r^j), that indeed satisfies (22). 
li k = |/i|, we are interested in the non-commutative random variables 

Ae2t(n,A:) 

Again, to control the cumulants, we shall exhibit a dependency graph for the families of 
random variables {p/^(^)}Aea(n,A:)- By the property of asymptotic factorization, if A and 
A' are disjoint, then Pfj,{A) and p^{A') are independent (in the non-commutative sense, 
w.r.t. the trace t^^). Therefore, one can associate to {p;i(A)}yiea(n,fe) the dependency 
graph B defined by: 

• its vertex set is 2l(ra, k); 

• there is an edge between A and A' if |yl fl A'l > 1. 

The graph B is obviously regular with degree strictly smaller than fc^ ri'^''^^. On the other 
hand, the expectations of its vertices are all smaller than 1 in absolute value. So one can 
once again apply Theorem 6.3 and we get: 

Proposition 8.6. Fix a partition fi of size k. For any r < 1, one has 



\n 



W(r^)| < 2''-V'^-2n^^'(A;V=-i)'-i; 



\ n 
The multi-variate version of this reads then as: 



|^M(xW)|<r^-2 [1 



Proposition 8.7. Let fi^, . . . , fi"^ be integer partitions of sizes ki, . . . ,kr. The following 
bound holds: 

^{x(^.,---,xj-i)\<k,---kr{k, + --- + k^y-' (^IJ \ 



8.2.2. Limits of the second and third cumulants. Because of Lemma 8.5 and Proposition 
8.7, for any fixed integer partitions, 

K (x(;;), . . . ,x(;;)) n'-^ ^ /. (r^i, . . . , s^r) ^^i+-+^.-(-i) 

converges to a constant. Let us compute this limit when r = 2 or 3; we use the same 
reasoning as in §7.3.2. 

As K is invariant by simultaneous conjugacy of its arguments, the summand in Equation 
(30) depends only on the graph G = Ga associated to the collection A = {A^, . . . , A^), 
and we shall denote it k{G). We fix partitions /i^, . . . , /i'^ of respective sizes ki, . . . ,kr, 
and write 

G 
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When r = 2, we have to look for graphs G on vertex set Vt = [ki\ U [k-2\ with 1-contraction 
connected and at least /ci + /c2 — 1 connected components, because these are the ones that 
will give a contribution for the coefficient of n^i+^-2-'^. For i G [^(/U^)] and j G [£(/i^)], 
denote 

{l? N /i2)(2, j) = (/il \ /il) U (/i^ \ ^]) U ill] + /i^ - 1}. 

This is the cycle type of a permutation p^i{A^) p^2[A'^), where Ga is the graph with one 
edge joining an element of A^ in the cycle of length n] with an element of A"^ in the cycle 
of length /i|. These graphs are the only ones involved in our computation, and they yield 

k{G) = p(^i^^2)(jj)(cj) -p^iu^2(a;), 
where for a partition fx we denote p^{uj) the product Y[i=i P^J.^i^)■ So, 
Proposition 8.8. For any partitions ji and v, the limit of n K(Xp^ , Xp^) is 

i=l j=l 

In particular, for cycles fi = (k) and u = {[), 

lim n K(Xf\xl''') = kl [pk+i-i{uj) -pk,i{uj)). 

n—>-oo \ / 

When r = 3, we look for graphs G on vertex set Vjk = [ki] U [k2] U [k^] with 1-contraction 
connected and at least ki + k2 + ks — 2 connected components. They are of three kinds: 

(1) One cycle in p^2(A^) is connected to two cycles in p^i{A^) and p^3{A^), but not 
by the same point in this cycle of p^2(A^). This gives for p^i{A^) p^2(A^) p^3(A^) 
a permutation of cycle type 

(/i^ N /i^ N p^){t,J, k) = (/i^ \ pD U (p2 \ ^2) y (^3 ^ ^3) ^ 1^1 ^ ^2 ^ ^3 _ 2}. 

and the corresponding cumulant is 

~ P{(fl^Ktl^){i,j))Ufl'-ii(^) — P{(fl'^MfJ.3){j,k))Utl^i^)- 

In this description, one can permute cyclically the indices 1, 2, 3, and this gives 3 
different graphs. 

(2) One cycle in p^2(A^) is connected to two cycles in pp_i{A^) and p^3(A^), and by 
the same point in this cycle of p^2[A'^). In other words, there is an identity 
al = af = a^. This gives again for p^i(A^) p^2(A^) p^3(A^) a permutation of cycle 
type (p^ N p^ N p^){i,j, k), but the corresponding cumulant takes now the form 

k{G) = 2p^lu^2u^3(w) + P(^lt^f^2Ktl'^){i,j,k)i(^) — P({m1n^2)(jj))u^3(c<;) 
~ P{(p^MfJ.3){j,k))Utl^ (W) — P{(p^Mfl'i)(i,k))Ufl'^ (W)- 

Here, there is no need to permute cyclically the indices in the enumeration for A^^. 

(3) Two distinct cycles in p^2(A^) are connected to a cycle of Pp^i{A^) and to a cycle 
of p^3(A^), which gives a permutation of cycle type 

(p^ N p2 N p''){l,j;k,l) = (p^\p,l)U(p2\{p2,p2})u(p3\p3)u{p,^+p2-l,p2+pf-l}. 
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The cuniulant corresponding to this last case is 

and again one can permute cychcally the indices 1,2,3 to get 3 different graphs. 
Consequently: 
Proposition 8.9. For any partitions fx, v and 5, the limit of n'^ K{Xp^ , Xp^ , Xpg) is 



Z/3Z \ i=l J=l fc=l 

^m) n^) ^(5) 

-P((/JBOl/)(i,j))Ui(^)-P((l'BOi)(fcJ))LlM(^) / 



i(ti) c(iy) tyo) 



_|_ \ ^ \ ^ \ ^ ,, ,, . X. f'^PtiUi^usi'^)+P(,j.x^x5)(i,j,k){'^)-P((iixv)(i,j))udi'^) 



i=i oyfc)=i 1=1 
eifi) e{u) i{5) 

^ ^ \ -P((iyxS)(j,k))Uii{'^)-P{{^xS)(i,k))U,yi'^) J ' 

i = l j=l k = l 

where Xlz/sz '^^^'^'^ ^^^^^ one permutes cyclically the partitions /i, i/, 5. /n particular, for 
cycles jj, = (k), v = (/) and 5 = (m), liian-^oo n"^ f^i^j^ , Xi , Xm ) is equal to 
klm[{k + l + m- l)pk,i,m{(^) + {k + I + m - 2) pk+i+m~2{uj) 

- (k + l-l) Pfc+/-l,m(w) - (/ + m - 1) Pl+m-l,k{^) -(l + m-l) Pk+m-l,li^)) ■ 

One recovers for k = 1 = m the values of cr^ and L announced in Theorem 8.3. 

8.3. Asymptotics of the random character values and partitions. Fix integer 
partitions fi^, . . . , fi^, and consider the random vector 

fj, H- 

The previous results show that the asymptotics of the multivariate generating series are 

where k'^*'-'^ = liTCin^oo n K{Xpl , Xpl) and k^*'-?'^) = lim„_j.oo'"'^ ^^(-^p i,-^p "L-^p 3) are the 
limiting quantities given in §8.2.2. So, one has mod-Gaussian convergence for every vector 
of random character values, and this ends the proof of Theorem 8.3 and Corollary 8.4. 
Note that the speed of convergence of the cumulants is each time a 0((q;„)~^) because 
of the polynomial behavior established in Lemma 8.5; therefore, one can indeed apply 
Corollary 3.6 with a„ = n and /3„ = n~^. 

These estimates on the distribution of the random character values are sharp but in one 
case: when u = ((0, 0, . . .), (0, 0, ...)). This parameter of the Thoma simplex corresponds 
to the Plancherel measures of the symmetric groups, and in this case, since ^2(1^) = 
^3(0;) = ■ ■ ■ = 0, the parameters of the mod-Gaussian convergence are all equal to 0. 
Indeed, the random character values under Plancherel measures do not have fluctuations 
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of order n~^/^. For instance, Kerov's central limit theorem (c/. [IO02]) ensures that the 
random character values 

^fk 
on cycles Ck of lengths k > 2 converges in law towards independent Gaussian variables; 
so the fluctuations are of order n"'^/^ instead of n~^/^. One still expects a mod-Gaussian 
convergence for adequate renormalizations of the random character values; however, the 
combinatorics underlying the asymptotics of Plancherel measures are much more complex 
than those of general central measures, see [Sni06a]. 

From the estimates on the laws of the random character values, one can prove many 
estimates for the parts Ai, A2, . . . of the random partitions taken under central measures. 
The arguments of algebraic combinatorics involved in these deductions are detailed in 
[FM12, Mell2], so here we shall only state results. Given a partition A = (Ai, . . . , A^) of 
size n, the Frobenius coordinates of A are the two sequences 

13 l\ / 1 3 1 

Ai - -, A2 - -, . . . , Ad - (i + - j , ( A'l - -, A2 - -, . . . , A'^ - d + - 

where A'^, Ag, etc. are the sizes of the columns of the Young diagram of A, and d is the size 
of the diagonal of the Young diagram. Denote (ai, . . . , a^), (61, ... , bd) these coordinates, 
and 

d d, 1 

ai ^ sr-^ bi 



i=l i=l 

This is a (random) discrete probability measure on [—1, 1] whose moments 



are also the moments of the Frobenius coordinates, so Xx encodes the geometry of the 
Young diagram A. We shall also need 

d d 

i=l i=l 

which will appear in a moment as the limit of the random measures Xx. Here, 7 = 

1 - E^i «i - E^i A- Notice that E„,^[xf )] = r^ick) = X^(x^-i). 

It is shown in [IO02] that for any partition A of size n and for any k, 

PkW = ^fc(A) + remainder, 

where the remainder is a linear combination of symbols E^ with |/i| < k. It follows that the 
cumulants of the pkS satisfy the same estimates as the cumulants of the SkS. Therefore, 
the rescaled random variable V(x'^~^) = n^^^ {Xx{x''~^) — X^{x''~^)) converges in the mod- 
Gaussian sense with parameters n^^^ cr^ and limiting function ip{z) = exp(L ^), where a^ 

and L are given by the same formula as in the case of the random character value XJ^'^ , 

that is to say 

a^ = p (p2fe-i(w) -Pkiujf); 

L = k^ ((3A; - 2)p3fc-2M - (6A; - 3)p2fc-iMPfcM + (3A; - l)pk{oof). 

Actually, one has mod-Gaussian convergence for any finite vector of random variables 
Vix^-^), k>l. 
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Now, there is a convenient way to rewrite the hmiting values of the second and third 
cuniulants. Denote T : C[x] — > C[x] the operator 

P(x) ^ —(xP(x)). 
ox 

Then, for any polynomial P, if V{P{x)) = n^/^ {Xx{P{x)) - X^{P{x))), then V(P(a;)) 
converges in the mod-Gaussian sense with parameters 

tn = n^/' (X^iTP)^) - X^{TP) X^TP)) (31) 

and limiting function exp(L y), with L equal to 

X^{T{{TPf)) - {X^iTPyf -3X^{TP) X^{T{{TPf)) + 3X^{T^P) {X^{TP)f . (32) 

The same holds for vectors of fluctuations (V(Pi(x)), . . . , 'W{Pr{x)))^ with limiting joint 
cumulants obtained from Equations (31) and (32) by "polarization": for instance, the 
limiting joint covariances are X^{TPi^ TPi^) — X^(TPjJ X^{TPi^). 

All this leads to the following result of mod-Gaussian convergence, which has a infinite- 
dimensional flavour. It is convenient to introduce 



A^'^) : C[x] ^ C[x\ 



gfc 

P ^ P^^- 
and for an operator O : C[x] — )■ C[x] or a linear form F : C[x] — > C, 



OW:0C[x]®^^C[a;] 



fc=0 



Pi®P2®---®Pk^ 0{Pl) X 0(P2) X ■ ■ ■ X 0{Pl 



k 



oo 



fc=0 
Pi ® P2 ® • • ■ ® Pfc ^ P(Pl) P(P2) ■ ■ ■ P(Pfc). 

Theorem 8.10. The fluctuations of random measures n^^'^^Xx — X^) converge in the 
mod-Gaussian sense with covariance operator 



■^n,LU 



n^/=^(X^)W o (tW o A(2) - A(2) o tW), 



and limiting functional 



^ = exp (^^^^Y^ ° {T^''^ - 3 id ® T^"^ + 3 id®^ ® T^"^ - id®^) o A^A . 

By that we mean that for any polynomial P = ZiPi{x) + 2:2 P2 (a;) + ■ ■ ■ + Zr Pr{x) in 
C[x][zi, . . . , Zr], one has the asymptotics 



E 



gn2/3(X;,{P)-X„(P)) 



exp(=4^)*(P)(l + o(l)). 



It is difficult to obtain from there moderate deviations for the parts Ai, A2, • • . of the 
partition; but the theorem gives at least a central limit theorem when one has strict 
inequalities ai > a2 > ■ ■ ■ > ai > ■ ■ ■ and /3i > /32 > ■ ■ ■ > (3i > ■ ■ ■ (see [Mell2], and 
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also [Bufl2]). Indeed, replacing the polynomial P by a smooth test function ipi equal to 
1 around ctj and to outside a neighborhood of this point, one has 

Xa(^.)-X^(^,) = --«. (33) 

n 

with probability going to 1, and Theorem 8.10 ensures that the quantities in the left-hand 
side renomalized by ^/n converge jointly towards a Gaussian vector with covariance 



n 



K{iJ) = X^iTipi Tifjj) - X^iTipi) X^iTipj) = 6ij Oi - aiOj. (34) 

So, the fluctuations 

V^ "i 

\ n 

of the rows of the random partitions taken under central measures P„^^ converge jointly 
towards a gaussian vector with covariances given by Equation (34), and one can include 
in this result the fluctuations 

of the columns of the random partitions, with a similar formula for their covariances. 

The reason why it becomes difficult to get by the same technique the moderate devi- 
ations of the rows and columns is that in Equation (33), one throws away an event of 
probability going to zero (because of the law of large numbers satisfied by the rows and 
the columns, see e.g. [KV81]). However, one cannot a priori neglect this event in com- 
parison to rare events such as {oj — nai > n"^^^ x}; indeed, these rare events are themselves 
of probability going exponentially fast to zero. Also, there is the problem of approxima- 
tion of smooth test functions by polynomials, which one has to control precisely when 
doing these computations. One still conjectures these moderate deviations to hold, and 
^2/3 ^Oj _ Q,^"^ |-Q converge in the mod-Gaussian sense with parameters n^^^{ai — of) and 
limiting function 

i>[z) = exp I z 

— this is what is given by Theorem 8.10 if we ignore the previous caveats, and still suppose 
the ai and Pj all distinct. As explained in [Mcll2], this would give moderate deviations 
for the lengths of the longest increasing subsequences in a random permutation obtained 
by generalized riffie shuffle. 



9. Appendices 

9.1. Properties of the kernels firp^d- We prove here Lemma 4.3. On the hypercube 

[— £,£:]'^, |-F(x — y) — P(x)| is smaller than md^'^ e, whereas on the complementary, the 
mass of the integral kernel is smaller than 

2d f°° 1 , 2d(2k + l) 

dx 



I{k) J, T^k+i^2k+2 I{k){Tef>'^ 
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It can be shown that I{k) is always a rational multiple of vr, decreasing with k — for 
instance, /(O) = vr and /(I) = ^. On [ 



12- ^■'^ L 2V2k+l' 2V2k+l'' 

1 — COs(x) 1 TT^ 1 / TT^ 

> ^^ T > - 1 



x2 -2 96(2A; + 1)-2V 24(2A; + 1) 

sin X TT^ 

> 1 



X - 24(2A; + 1)' 



So, 



1 2%/2fe+T /I — cosx\ /sina;\ 

2V2fc+l 

/ 2 \ 2fe+l 7]- ( 1 — Zif 



2V2A; + 1 V 2A{2k + l)J " 2v/2Fn 
and therefore, /(j__^_^jd^, A^*^^rf(x) rfx < ^0|wi-- Then, 

|/i?)(i^)-Mi^)l<l|A?L*i^-i^lloo 



< sup f / + / ) (|F(x - y) - F(x)| Ar,K^(y) rfy) 

,/ ,1/2 5Cc^(2fc + 1)3/2 \ 27 2fc+3 /Cm^'^+iX^ 



1 



by choosing on the last line the minimizer e = (^{5 C d^^"^ {2k + lY^'^)/(mT'^''^^)^^''+'^ of 
the previous bound, and simplifying a bit the constants. 

The second part of Lemma 4.3 relies on the following arguments. Denote ^t the 
operator on "^"^(IR) defined by 

■^Tf{x) = —j f{y)dy. 

The image of ^'^(M) by ^t is included into ^'^'^^(M), and on the other hand, if / is 
supported by [a,b], then ^tI is supported by [a — T,b + T]. At the level of Fourier 
transforms, one has: 

{^Tf){x) = ^ £^^ m dy = ^l^ f{z) (£^^ e-y dy^ dz 

1 z 

with gxix) = '^'^^^ . As a consequence, if / is a non-negative function with Fourier 
transform supported by [—T,T], then {qtY^ f is a non- negative function with Fourier 
transform of class ^^fc ^nd supported by [—{2k + 1)T, {2k + 1)T]. However, 

Ai°)(x) = ii^.?^ ^ ^\x ^' '■" 

TT 



T y-> „ y^2 --TV-. > ' J. 



which is supported by [—T^T] and with values in [0, 1]. So, the Fourier transform of Aj, 



takes its values in [0, 1], is supported by [—{2k + 1)T, {2k + 1)T] and is of class ^'^^. The 



result for A^iL^ follows immediately. 



MOD> CONVERGENCE AND PRECISE DEVIATIONS 



75 



9.2. Gaussian regularity of convex bodies. Let S be a convex body in MJ^ containing 
in its interior, liomeomorpliic to a closed ball and with a boundary dB which we assume 
to be Lipschitz in the following sense: there exists a constant L > such that the boundary 
dB is parametrized by an homeomorphism x '■ S'^^^ -^ dB with 



xid) e R+e 



\\xiO)-xiO' 



\x 



< L\\9-9'\ 



Notice that the constant L does not change if one renormalizes B and looks at XB instead 
oi B, X > 0. In other words, L only depends on the form of the convex body viewed from 
the origin. We refer to [Sch93, Hor94] for references on convex bodies. We claim that 
under these assumptions, B is Gaussian regular with constant 4(1 + 2L)^. As will be clear 
from the proof, the origin only plays a role of reference point, so the same holds for any 
convex body with boundary L-Lipschitz viewed from a given point in the interior of B. 



First, for any direction 6 G S'^"^, we claim that the intersection of M^^ with B'^\B is an 
interval. Indeed, if two points a, b on R+6 are in 5^ \ B, then there exists corresponding 
points c, d in 5 with ||a — c|| < e and ||b — d|| < e, and then by convexity of B all the 
points between a and b are in i?^. So, there exists real numbers < a < f3 such that 



{B'\B)r]R+9 = {a9,l3e], 



and in fact one has necessarily a6 G dB, and ad 
X{0'), so 



/3 



a 



X 



< 



- x{0')\\ + \\x{0') 



x{0)- Then 

X{0)\\<e + 



L 



is e-close to some 



\9-9' 




Figure 14. The intersection of an half-line R^6 with B^ \B, B convex 
body is a segment whose length is bounded by a multiple of e if G -B° and 
dB is Lipschitz. 



However, for two points a and b on two lines R^9 and IR+6'', one has always ||a - 
-||^ — ^'11, the worst case being when 6 = —9' and b is near zero. Consequently, 



bll > 



(3-a<e + 2L 



\X 



\\m 



\xie')-m\<i^L + l)e. 
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Then, one writes: 

1 r iixii2 , 1 



(27r)^/2 



(ix 



B^\B 



< 



(27r)^/2 



(27r)'^/2 
< {2L + l)e 







^ dr) dfigd-i{6) 



a+{2L+l)s 



Q 2 f 



d-l 



dr 



2i-rf/2 

— T^rrr niax | e 
T{d/2) r>o 



2 f 



d-l 



<2{2L + l)e 



by using Stirling estimates. 

Similarly, we claim that for any direction 9 G S'^^^, the intersection of M_|_6' with B\B~^ 
is an interval. Fix e, 77 > 0, a direction 9 and consider the set 

/,,^,e = {x G fi n R0 I fi(,,,+„)Cfi}. 

This is a convex compact subset of M0, as shown by Figure 15. However, B^^ = {{B^)'^Y = 
|J^q{x G B I -B(x,E+r?) C -B}, so M.9 n -B^^ is the increasing union of the segments le^nfi- 
Therefore, there exists real numbers < 7 < a such that 

{B\B-')r\R+9= [-f9,a9], 

possibly with 7 = 0. As before, a9 = x(6'). Denote 9' a direction such that 7^ is £-close 
to a point of M.^9' which is outside B; one can assume without loss of generality that this 
point is on the boundary of B, that is to say that \\'y9 — x(6'')|| < e. Again, we want to 
bound a — 7 by a constant times e. 




Figure 15. The intersection of a line M.9 with B ^ = {{B'^YY is the union 
of all sets Ie,ri,ei ''7 > 0, and these sets are nested segments. 



• If 7 > 4L e, then one has 

« - 7 = MO) - io\\ < urn - io\\ + um - X 



<e + L\\x 



\9-9'\ 



The second term in the right-hand side is smaller than L a -||7^ — x{0') 



< 



2Lea 

7 ' 



SO 



a 



7 <£ + 



2Lea 

7 



R<1 + 



{2L + l)e 



7 



2Le 
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with R = -. Thus, a < 7 + (2L + 1)£: _'^^^ , and by the hypothesis 7 > 4Le, this 
leads to a -7 <2{l + 2L)e. 

• Conversely, if 7 < ALe, then ||x(^')ll < (4-^ + 1) £, so 

« - 7 = llxW - 7^11 < \\x{0') - iO\\ + llx(^') - xm <e + L \\x{e')\\ \\e - e'W 

< (1 + 2L(4L + 1)) e < (SL^ + 4L + 2) £. 
So in any case, a — 7 < 2(1 + 2L)'^ e. 

Using then spherical coordinates to compute the integrals, we obtain the bound 

Hence, we have indeed proved that convex bodies with boundary Lipschitz with constant L 
viewed from a certain point of their interior are Gaussian-regular with constant 4(H-2L)^. 
Now, it should be noticed that any convex body B with non-empty interior has a Lipschitz 
boundary for a certain constant L and with respect to a certain reference point z in B°. 
However, this constant L may be extremely big if one does not choose z correctly {e.g. 
near the boundary of B). On the other hand, it follows from [BRIO] that one can in fact 
choose a constant G of Gaussian regularity that works for every convex body of M'', but 
the proof of this relies then on more complex arguments than before, and is much longer. 
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